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WESLEY CLAIR MITCHELL: THE ECONOMIC SCIENTIST* 


Avotr A. BERLE, JR. 
Columbia University 


HE legacy of a scholar is his teaching. The tribute to a scholar is 

the esteem of his peers. The immortality of a scholar is the influence 
his thinking exerts on generations yet to come. Wesley Clair Mitchell 
can fairly be named the greatest American economic scholar of the 
twentieth century. Laying a foundation for the evolution of scientific 
economics was his precise and monumental contribution. Why it was 
done at all and how it was achieved is a subject of first importance. Be- 
cause of Mitchell’s work, economics can fairly claim now to be called 
a “science,” just as, for lack of a similar contribution in political science 
and sociology, these sisters of economics still grope in the gray fog that 
lies between the area of speculation and the explored territory of gen- 
eralization and conclusion from observed facts. 

Colleagues, disciples and students bear affectionate and discerning 
witness in this volume to his achievement. It is fitting that they have 
done so because of their relationship with the man; but, more impor- 
tant, it is useful that they have done so because the book is a case- 
study of the process by which economics was translated from a kind of 
theology into a more or less exact science. The essays here collected by 
Arthur F. Burns thus are far more than a memorial; they are a con- 
tribution to the study of economic methodology, and cannot fail to be 
- causative. 

Before Mitchell tackled the field of economic method, economics 
consisted of speculative development of hypotheses formulated by 
earlier, intelligent observers of their environment. These attempted 
generalization from their limited and personal observation. Such was 
the work of Adam Smith, of Ricardo, of John Stuart Mill, and later of 
Marshall and of Taussig. Their speculations opened a wide door (of 





* A review of Wesley Clair Mitchell; The Economic Scientist, edited by Arthur F. Burns, (New York: 
National Bureau of Economic Research, Inc., 1952). 
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this, more later). Without them, no science could ever have been con- 
structed; but acceptance of their hypotheses without demonstrating 
their factual base clearly left economics in the realm of inductive phi- 
losophy rather than of scientific demonstration. It remained for 
Mitchell to pioneer the job of isolating identifiable economic factors, 
determining analytic definitions of phenomena, working out methods of 
accurate measurement and continuing observation, verifying relation- 
ships or at least establishing probability of a relationship of cause and 
effect between phenomena. Classification, description, measurement, 
establishment of interrelationships, generalization, looking toward pre- 
diction, this in economics as elsewhere is the basis of scientific method. 
As a young man, Mitchell was unsatisfied by speculation without proof, 
and he accordingly introduced scientific method into economics with 
modesty and integrity, combined with a good humor which made him 
at once the most beloved as well as the most respected of contemporary 
social scientists. 

How he did it is partly a saga of life, partly a demonstration of close 
thinking. In one sense, he never transcended an early conception 
blocked out when he was studying at the University of Chicago. There 
he was working on that grimmest of assignments, a Ph.D. thesis, de- 
signed to be “A History of the Legal Tender Acts.” But it evolved into 
“History of the Greenbacks,” and became eventually a powerful analy- 
sis of monetary theory in the United States. His intellectual godfathers 
in that period were surely diverse: Thorstein Veblen, brilliant, tor- 
mented, biting; John Dewey, challenging conventional process of 
thought; Jacques Loeb, psychologist, and physiologist, devoted with- 
out limit to scientific method. This constellation was something for 
students to dream of: a brilliant analytic mind, a sceptical but human 
philosophical mind, and a rigorous scientific methodological mind. 
Mitchell absorbed wisdom from all three ; and an economist was formed. 
In unpredictable result, the concept of a true science of economics was 
really generated through Mitchell by a tempestuous dreamer, a revolu- 
tionary philosopher and a pessimistic physiologist. 

The consequence of this amazing combination of stimuli (carefully 
analyzed in an essay by Professor Frederick C. Mills) was Mitchell’s 
application of methods known in physical science in the form of a mas- 
sive monograph on “Business Cycles.” It was magnificently and ac- 
curately factual—a description rather than an explanation. At the 
close of his life, Mitchell was still working on business cycles. He still 
considered that he had not solved the problem. But as a result, the 
“business cycle” had been identified if not explained. The National 
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Bureau of Economic Research, in whose organization Mitchell had 
played the leading part, had made notable progress in the vast task of 
analyzing, describing, and collecting accurate data in sequence. Meas- 
urement and actual calculations of “national income” and of “gross na- 
tional product,” the analysis of the rise and fall of price levels, all stem 
from this life-time work. “The notion that inquiries should be framed 
from the start in such a way as to permit of testing the hypothetical 
conclusions,” the notion of sequence, the concept of consecutive 
growth as opposed to older ideas of assumed but unexplained (and ac- 
tually non-existent) equilibrium, are perhaps the most causative of 
Mitchell’s ideas. Hundreds of men are working now on lines of investi- 
gation opened up by them. 

The bibliography of his work, beginning with “The Quantity Theory 
of the Value of Money” (1896) and closing with “What Happens dur- 
ing Business Cycles: A Progress Report” (1951), is a catalogue of hun- 
dreds of studies, large and small, of exact, factual data, so handled that 
each piece of measurement or description could be related to and used 
in connection with other collections of fact-data. Inspired by his in- 
fluence, a host of men have since followed his example. At the close of 
his life, great areas of economic theory have been or can be tested by 
reference to relevant facts. A demonstrated conclusion can be dis- 
tinguished from unverified theory. Economics has moved steadily for- 
ward; it is fairly on its way. 

The essays here collected show all this, though each naturally follows 
the turn of its respective writer. Mills, perhaps chiefest heir of Mitch- 
ell’s thought, places his professional sketch next to a tender and pene- 
trating memoir written by Lucy Sprague Mitchell. The two are not as 
far apart as might be supposed. Research which tests dogma against 
fact is always controversial and in some measure impersonally destruc- 
tive and it may even be revolutionary. An honest man may find him- 
self intellectually bound to refute by his facts the most tenaciously held 
doctrine, and is not popular when he does so. A true scientific scholar 
must set his lance above mischance and ride the barriére. The profes- 
sional disciple and the wife find much common ground in their descrip- 
tions. 

Mitchell was always working on two levels: thus, in 1920 he was 
founding the National Bureau of Economic Research with the imper- 
sonal scientific end of measuring magnitude of national income and 
measuring its principal components. But in the same year he joined 
James Harvey Robinson and Alvin Johnson in founding “The New 
School for Social Research” as a part of his life-long protest against 
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restricting freedom of speech in academic institutions and somewhat, 
later he was endeavoring to salvage the remaining years of Thorstein 
Veblen from the dark limbo into which Veblen’s stormy soul had 
steered him. Mitchell gave the best of his thought to governmental re- 
search and planning, which necessarily exacts a high degree of organ- 
ized conformity. But to the end of his life he worked at experimental 
education—which is anything but conformist. To say of him, as does 
John Maurice Clark in his reprinted Memorial address, “American 
economics has lost its undisputed first citizen,” is a plain statement of 
fact. 

Method, well discussed by A. B. Wolfe, is worth a word. We shall 
always have with us the academic individual who fears to make a hy- 
pothesis or propound a theory and confines himself to the less contro- 
versial task of measuring phenomena. But the fact is that measuring 
phenomena without some hypothesis to determine what facts to collect 
is sterile business. Even a fact-collector must have a hypothesis—else 
a count of paving stones in Times Square would be just as significant 
as calculating the size and distribution of the national debt. Mitchell 
never fell into the error of first accumulating facts and then construct- 
ing a hypothesis. He knew better than most that the hypothesis guides 
the collection of facts; but that from the collected facts, the hypothesis 
can be tested, revised, or discarded and new hypotheses can be made. 
On the other hand, he never committed to any body of theory, and 
thereby provoked a major debate in certain quarters. 

In the one critical essay included in the volume, Paul T. Homan ob- 
serves that, as a sequel of Mitchell’s method, increasingly sound scien- 
tific work is being devoted to detailed studies, disclosing the manner 
of the workings of portions of the economic system “to the ultimate end 
of assisting in intelligent social guidance. Realism is Allah, and Mitchell 
is his prophet. Paradise may be around the corner.” Unless, suggests 
Homan, you commit to some sort of theory, you are lost. Implying that 
Mitchell did not commit himself, he ironically adds that there is no 
scientific reason to suppose that the “voluntary process of cumulative 
causation is amenable to intelligent social control. . . . There is no guide 
here but faith.” Which is another way of saying that value judgment of 
Mitchell’s scientific economics depends on philosophical premises; that 
Mitchell had one but did not state it and could not test it; that for all 
his realist detachment, Mitchell’s work was really energized by quite 
homely, non-scientific convictions, and that he might as well have 
stated them as theory. 

The implicit criticism is, of course, that without a comprehensive 
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theory which Mitchell never embraced and never formulated, the sig- 
nificance of his work can not be appraised. This criticism is fair enough 
though not too damaging. Actually, as a result of Mitchell, any eco- 
nomic theory, formulated or hereafter formulated, will be held valid 
only if verified by observable data, and testing for some time to come 
will probably be by use of the statistical tools Mitchell forged or by 
methods he developed. Development of method was Mitchell’s chosen 
function; fundamentally he was a master toolmaker. One suspects he 
also cherished (by inheritance from Veblen) a hidden and unfulfilled 
desire to write a great Utopia, but consciously limited himself to sta- 
tistical work lest he become intoxicated with his own dreams. But— 
men are led by their dreams and their dreamers, as politicians know all 
too well. Statistical analysts can merely record the results of their pur- 
suit, in hope that succeeding generations may perhaps be better guided 
in selecting what manner of dreams and dreamers to follow. 

If Mitchell’s name is attached to anything, it will be to a school of 
economic thought now known as “institutionalism.” In Wolfe’s words, 
“Economics must be a concrete and realistic study of institutional 
habits and relations, and these must be regarded not as fixed but as 
continually changing; hence economics is not only an institutional but 
a dynamic and evolutionary science.” To this is linked another apho- 
rism that economics has only one justification—the furtherance of eco- 
nomic welfare. Men, en masse, behave in certain standard ways, accord- 
ing to uniform though complex patterns. These patterns are in constant 
evolution, but in part at least they can be measured by statistics. Pat- 
terns of human action tend to be reflected in institutions or persistent 
groupings; they include, for instance, the American corporation (the 
connection in which this reviewer first met Wesley Mitchell) or the 
habits and framework of consumer credit or possibly a Communist 
commissariat or twenty other similar institutional operations. These 
institutional patterns become, in themselves, factors of greater or less 
force in economic action. All this transcends what was commonly 
known as “orthodox” economic theory. (Milton Friedman makes this 
point with discernment in his chapter on “The Economic Theorist.”) 
It accounts for Mitchell’s frank recognition that the money economy 
(commonly miscalled “capitalist” economy) is by no means the only 
possible system in which economic principles have validity, or in which 
economic science can exist. 

*x* * * 

Because the chief authors of these essays are economists, any lay 

reviewer must submit his comments with diffidence. No criticism of 
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Mitchell is implied in the observation that his insistence on demon- 
strated facts isolated him from some of the driving conflicts of his time. 
Facts are few and hard come by; to insist on them before asserting a 
viewpoint does accomplish detachment. Yet detachment is, and in 
Mitchell’s case was, bought at a price, whose extent is just beginning 
to be visible. If, for example, the dominant motive activating human 
patterns ceases to be individual search for profit, and becomes rather a 
reaching for power, individually or through collective groups, much of 
current economic theory would have to be revised and statistical con- 
clusions would assume different significance. Only in 1952 an econo- 
mist’s first attempt at reappraising the American capitalist system in 
this sense has reached print in Kenneth Galbraita’s American Capital- 
ism: The Concept of Power. The thesis would not be undreamed of in 
Mitchell’s philosophy—but it would be excluded from his economic 
work. 

Again, certain bench marks sacred to economists, such as the “basic 
interest rate,” may be a result of inevitable tides; or they may be the 
result of more or less consciously managed monetary systems. Indi- 
vidual search for security and comfort, indeed, may perhaps give way 
under some circumstances to a mass desire, or at least willingness, to be 
dedicated or subjected to directed collectivism, as for example, in the 
Soviet Union. It is quite conceivable that as modern technology pro- 
gresses, the power motive rather than the profit motive may become 
the key to institutional development. It is at least possible that the 
titanic world struggle of today arises from the tensions created by some 
such shift. It could be, in a word, that economics apart from political 
science is meaningless abstraction. Detachment from political struggle 
may not be a luxury permitted to economists in days to come. Mitchell 
never ignored such struggles; but he never mixed his thoughts about 
them with his economic work. 

In like manner, Mitchell’s insistence that measures looking toward 
social reorganization must be based on “established know. _. -e” is 
merely Utopian. Statesmanship usually has to meet crises arising out 
of the unknown or unexplained. Historians may later supply the “es- 
tablished knowledge” needed for full diagnosis; but the statesman, 
businessman, or politician rarely has the luxury of awaiting that happy 
day. He must act in accordance with someone’s best estimate of prob- 
abilities, and the event will prove him right or wrong. For an econo- 
mist to refuse to estimate in these circumstances is scientifically credit- 
able, but it deprives the politician of the putative best estimate. 


* * * 
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For laymen, much of the economist’s language is almost unintelligi- 
ble. But they understood Mitchell. Most educated people today have 
some idea of the national income of the United States and of its fluctua- 
tions, and the national government uses his statistical instrument as a 
chief factor in determining fiscal policy. Laymen can and do make use 
of the growing body of data covering its use and its distribution. Busi- 
nessmen as well as economists have a gradually growing knowledge of 
the relation of money and credit to the rise and fall of employment and 
trade, and to the creation and distribution of goods and services. 
Thousands of investors who never heard of Mitchell know about the 
business cycle, and seek to anticipate it in conducting their affairs. 
Hundreds of bankers have unostentatiously changed their financial 
theory because of his critique of the quantity theory of money. Impact 
of his ideas is found all over the American governmental and business 
system; it is no light recognition of his accomplishment. 

Yet even today Mitchell’s greatest premise has not been fully under- 
stood. Nearly thirty years ago he indicated it: 

In becoming consciously a science of human behavior, economics will lay 
less stress upon wealth and more stress upon welfare. Welfare will mean 


not merely abundant supply of serviceable goods but also a satisfactory 
working life filled with interesting activities. ... 


He dreamed of developing “criteria of welfare” as well as of statistical 


measures for income and wealth. In his view, this science of economics, 
struggling to be born, was but one tool in the greater search for life— 
however conceived. 

Mitchell’s great contemporary, Professor Joseph Schumpeter of 
Harvard, a few days before he also joined Mitchell in the Valhalla of 
scholars, wrote in closing the record: 

Here was a man who had the courage to say, unlike the rest of us, that 
he had not all the answers; who went about his task without either haste 
or rest; who did not care to march along with flags and brass bands; who 
was full of sympathy with mankind’s fate, yet kept aloof from the market 
place; who taught us, by example and not by phrase, what a scholar should 
be. 








THE VELOCITY OF TIME DEPOSITS* 


Grorce GaRvy 
Federal Reserve Bank of New York 


HE rate of turnover of time deposits belongs to the outer reaches of 

the terra incognita of the behavior of liquid assets. “Liquid assets” 
and not “money” is intentionally used because the question whether 
time deposits are part of the money supply is one on which there are 
differences of opinion among monetary theorists.! Even within the 
Federal Reserve System there is no unanimity: while the Board of 
Governors includes in the money supply not only time deposits of com- 
mercial banks, but also deposits of mutual savings banks and of the 
Postal Savings System, the Federal Reserve Bank of New York does 
not include any of these items.” 

Several writers on monetary problems have inquired into the relative 
velocities of circulation of various components of the money supply, 
but only after making their own decisions on the proper definition of 
money.’ Their explorations, moreover, generally involve certain as- 
sumptions as to the velocity (and other dimensions) of one or several 
components of the total money supply for which empirical data are not 
readily available. Such assumptions with respect to time deposits range 
from Keynes’ view that savings deposits have a “velocity of zero” to 
Burgess’ more generally used estimate of a rate of turnover of twice a 
yea..* Most recent writers either confine themselves tc the statement 
that the rate of turnover of time deposits at commercial banks is low 
or very low, or fall back on Burgess’ estimate. 

It is not intended to review here the controversy on the “proper” 
definition of the money supply. Indeed, a meeting of minds is more 
likely to be advanced by an inquiry into the behavior of time deposits 





* This paper is based, in part, on a report prepared for a technical committee of the Federal Reserve 
System. 

1 See, for instance, A. Marget’s exhaustive discussion, in The Theory of Prices, Vol. I, Ch. XVI. 

2 See, for instance, the article “Money Supply” in the Monthly Review of Credit and Business Condi- 
tions of the Federal Reserve Bank of New York, Nov. 1951. 

3 See Marget, op. cit., Vol. I, p. 463, footnote 10. 

4 W. R. Burgess, “Velocity of bank deposits,” Journal of the American Statistical Association, 1923, 
pp. 727-40. Burgess’ estimate, however, was based on fragmentary data obtained in 1922 from only 
six New York City banks for short periods, usually one month in each case. The validity of his gen- 
eralization of this rate for the country as a whole may be challenged on the basis of data which have be- 
come available subsequently and which show that time deposits in other parts of the country turn over 
less rapidly than in New York City. 

5 See, for instance, L. Currie, The Supply and Control of Money in the United States (Harvard Uni- 
versity Press, 1934), p. 50. Angell does not inquire into the turnover of time deposits because “there 
appears to be no data for measuring their rate of turnover in the past.” See The Behavior of Money 
(McGraw-Hill Book Co., 1936), p. 94. 
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than by further exploration of the logical possibilities of defining attri- 
butes of money. If it is agreed that some types of liquid assets, such as 
currency and demand deposits, are money par excellence, the approach 
might be to inquire whether time deposits (and other classes of liquid 
assets) show similar velocity and similar cyclical and seasonal patterns.® 
If business firms and consumers consider time deposits as equivalent to 

ioney, the behavior of time deposits should bear a close resemblance 

) that of currency and demand deposits. For the present study, the 
behavior of demand deposits will serve as a convenient frame of refer- 
ence since comprehensive data on the velocity and patterns of use of 
currency are lacking. 

Because of the paucity of data for commercial bank time deposits, 
the scope of this inquiry has been broadened to include savings institu- 
tions. While the great bulk of deposits of savings institutions consists of 
savings deposits of individuals, time deposits of commercial banks’ also 
include time deposits of their trust departments, of business firms, and 
of government units. Such deposits are frequently held in the form of 
certificates of deposit or in open accounts. The shift in the composition 
of time deposits accounts, as will be shown below, for the divergent 
movement of the rate of turnover of time deposits and of savings de- 
posits. 

Data on the turnover of total time deposits at commercial banks are 
available for only a short period prior to World War II. They have been 


collected from about 400 “money market” banks (weekly reporting 
member banks) in New York City and 100 other leading cities for the 
relatively short period from September 5, 1934 to February 1, 1939. 
Data on the velocity of savings deposits only, were collected, on an 
annual basis only, by the Savings Division of the American Bankers’ 
Association from about 140 scattered commercial banks for the period 
1940-47; similar rates for 1949-51 were obtained from more than 800 





6 It should be clear from the above that the concern here is with the “moneyness” of time deposits 
rather than with other aspects of the problem, such as the fact that time deposits provide reserves for 
credit expansion or that it is difficult to separate the two types of deposits. These two points have been 
used by Goldenweiser (in his recent American Monetary Policy) as arguments for including time deposits 
with the money supply. 

The fluidity of the concept of money has been increased by the war-time growth of liquid assets 
possessing a high degree of convertibility and attaining annual turnover rates similar—as a matter of 
fact, identical—to those of time deposits. Thus, Series E Savings bonds, after being held for at least 
sixty days, now are as easily and conveniently converted into cash as time deposits. Furthermore, while 
legally a 30-day notice can be enforced against holders of savings accounts, Savings bonds are payable 
on demand. Obviously, all types of liquid assets possess some degree of “moneyness.” Marget’s argu- 
ment that all deposits have some rate of turnover—be it only once every 1,000 years—may be extended 
to include all types of liquid assets. 

7 U. 8. Government and interbank time deposits are always excluded from the term “time deposits” 
as used in this paper. 
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commercial banks. These data can be supplemented by time series for 
the turnover of deposits at mutual savings banks, savings and loan 
associations, and the Postal Savings System which can be computed for 
periods of various lengths. In addition, several special surveys con- 
ducted by the Savings Banks Trust Company help to shed additional 
light on the behavior of savings deposits. 


THE RELATIVE IMPORTANCE OF SAVINGS DEPOSITS IN 
COMMERCIAL BANK TIME DEPOSITS 


The justification for drawing limited inference from the behavior of 
savings deposits to the behavior of total time deposits in commercial 
banks lies in the fact that the share of the other types of time deposits 
in total time deposits is known to have been small and declining. The 
earliest available data on the proportion of savings deposits in time 
deposits are limited to national banks, fewer than half of which re- 
ported savings accounts separately for June 30, 1910. In these banks, 
savings deposits accounted for about 57 per cent of a derived total, 
“time deposits” ;* by June 1921, the percentage of savings deposits and 
open accounts combined (no separate figures being available) was over 
73 per cent, and by June 30, 1926, over 79 per cent. Time certificates of 
deposit, which in 1917 accounted for nearly 40 per cent of all time de- 
posits, in 1926 represented only 21 and in 1929 16.5 per cent. 


In subsequent years, the importance of certificates of deposit and of 
open accounts declined not only relatively, but also absolutely, as 
shown in the following table for all member banks of the Federal Re- 
serve System: ' 








Time 
certificates 
of deposit 


Open Total time 
accounts* deposits 


Savings 


Call date deposits 





(In millions of dollars) 
December 31, 1928 9,810 1,895 1,089 12,794 
June 30, 1942 (latest 
date available) 10,357 566 749 11,673 





* Including small amounts in Christmas savings and similar accounts. 
Source: Board of Governors of the Federal Reserve System, Member Bank Call Reports. 





8 Prior to the Federal Reserve Act, national banking legislation did not specifically distinguish be- 
tween time and demand deposits. Since the classification of deposits reported to the Comptroller of the 
Currency was left entirely to the judgment of the reporting banks (see Annual Report of the Comptroller 
of the Currency for 1910, p. 7), and because of incomplete coverage, the percentage for 1910 is hardly 
more than an approximation. 

After 1921, the classification “savings deposits” had become identical with the subtotal “time de- 
posits other than certificates” and was reported by all national banks having such deposits. 

All time deposits, used here, exclude State and municipal and Postal Savings deposits. 
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By June 30, 1945 (the latest date for which such data are available 
for all national banks), the percentage that savings deposits represented 
of all time deposits of individuals, partnerships, and corporations at 
national banks had risen to 95.9 per cent. It was slightly lower— 94.0 
per cent—for all commercial banks largely because of the inclusion of 
State trust companies. 

Although no national data are available for the postwar years, those 
for State commercial banks and trust companies in New York State 
indicate that the prewar decline of time deposits other than savings 
deposits is still continuing. Even though the accelerated growth of the 
relative importance of savings deposits during the war years was re- 
versed in the first postwar years, at the end of 1951 savings deposits 
represented a higher percentage of total time deposits than at the end 
of 1940. This was true for banks located outside New York City (where 
the ratio in earlier years was close to the national average) as well as for 
those in New York City (where it was much lower than for banks out- 
side New York City because of the importance of trust department and 
other types of open accounts): 


Savings Deposits as a Percentage of Total Time Deposits of 
Individuals, Partnerships, and Corporations 








End of New York City Rest of New York State 





1940 
1945 
1951 





Source: New York State Banking Department. 


Thus, the long-run decline of the relative importance of time deposits 
other than savings deposits is apparently still continuing. Outside New 
York City (and possibly outside Chicago and one or two other money 
centers) time deposits other than savings accounts are, however, al- 
ready reduced to such a small proportion of the total that their further 
decline is unlikely to have 1uch effect on the velocity of total time 
deposits. 


TURNOVER RATES OF TIME AND SAVINGS DEPOSITS 


During the years immediately preceding World War II, time deposits 
in weekly reporting member banks outside New York City turned over 
from 0.66 to 0.82 times a year (Annual averages; for monthly data, see 
Chart I). However, time deposits at commercial banks include some 
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deposits of business firms, local governments, and trust departments, 
which tend to be more active than savings deposits. Since the weekly 
reporting banks include essentially money market banks with a rela- 
tively high proportion of such deposits, it is likely that the velocity of 


Cuart I. Annual Rates of Turnover of Time Deposits* in Weekly Reporting 
Member Banks Outside New York City, and of Regular Depositst in New York 
State Mutual Savings Banks. September 1934 through January 1939. 
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* Total time deposits except interbank bet.7een September 1934 and January 1938; thereafter time 
deposits of individuals, corporations, etc., States and political subdivisions. See footnote 6, page 177. 

t Including school savings accounts. 

Source: Board of Governors of Federal Reserve System, and Savings Banks Trust Co. 


time deposits at all commercial banks outside New York City was 
lower than the rates computed from figures reported by these money 
market banks. 

For the same reason, the velocity of time deposits in New York City 
(where, in recent years, savings deposits may have accounted for as 
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little as 50 per cent of time deposits) ranged between 1.65 and 1.99.° In 
comparison, demand deposits at the weekly reporting member banks 
in New York City turned over about forty times as rapidly as time de- 
posits.*° 

A detailed analysis of these data was made at the Board of Governors 
of the Federal Reserve System only for the single year 1936, when the 
annual rates of turnover ranged between 0.5 (Philadelphia District and 
New York District outside New York City) and 1.1 (San Francisco 
District). In New York City, time deposits were more active, but still 
turned over less than twice (1.4 times) a year as compared with 0.8 time 
for all 100 outside centers. The higher turnover rates in New York City 
were traced to relatively active time deposits of trust departments of 
some banks and in the San Francisco District to a large proportion of 
rather active local government funds. It was also suggested that report- 
ing (contrary to instructions) of renewals of maturing certificates of 
deposit as debits had likely resulted in overstating the velocity of time 
deposits, especially in the Boston and Minneapolis Districts (where 
certificates of deposit are more important than in other districts). The 
reporting of transfers of time deposits to demand deposits is likely to 
have had the same effect." 

The results of the ABA surveys are consistent with the rates based 
on Federal Reserve System studies. They show that annual rates of 


turnover of savings deposits at cooperating commercial banks ranged 
in 1940-51 between 0.45 and 0.60;” the rapid growth of savings de- 
posits which have more than doubled during this period may be re- 
garded as a sufficient explanation for the slight downward drift in the 
average rate of turnover over the twelve-year period. 





® The rates of turnover of time deposits in weekly reporting New York City banks reflect primarily 
activity of time accounts other than personal savings accounts. Their monthly fluctuations during the 
years 1935-39 were large and did not display the seasonal pattern shown by outside reporting centers or 
by savings banks. Increases in the rate of time deposit turnover of 50 or more per cent from one month 
to another were not infrequent. Such fluctuations most likely reflect unusual withdrawals from cor- 
porate, municipal government, and trust department time deposits. 

While the rate of turnover of time deposits in banks outside New York City drifted downward dur- 
ing the period under review, the corresponding rate for New York banks rose. 

10 On the turnover of demand and of total deposits, see George Garvy, The Development of Bank 
Debits and Clearings and Their Use in Economic Analysis (Washington, 1952), Ch. VII. 

1 Turnover rates for individual banks were studied in the Boston District only. Wide variations 
were found, partly reflecting differences in reporting and classification practices of banks. 

12 A careful inspection of rates of turnover for individual banks in the ABA sample for 1940-47 
reveals that in most banks, these rates were within the range of 0.3 and 0.6, which may be considered 
narrow in view of the differences in the structure and mobility of population, banking facilities, account- 
ing practices, age of the savings departments, and other pertinent factors. 

While institutions with very large savings deposits generally have higher turnover rates than those 
which hold a smaller volume, there are exceptions. As a matter of fact, some banks with upward of 
$100,000,000 of savings deposits ranked among those with the lowest turnover rates. 
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Annual Raie of Turnover of Savings Deposits Savings 
Departments in Commercial Banks* 








Year Year 





1940 0.53 1946 
1941 0.58 1947 
1942 0.54 1948 
1943 0.49 1949 
1944 0.47 1950 
1945 0.49 1951 





* The survey for 1940-1947, conducted in 1948, covered 140 banks; the survey for 1949-51, made in 
1952, included 808 banks. 
Source: American Bankers’ Association. 

In contrast, deposits of mutual savings banks have been turning over 
since the end of World War II, on the average, once every four years; 
rates of turnover of shares of savings and loan associations since 1945 
have been of the same magnitude (see Chart II). These velocity ratios 
are about half as large as those of savings deposits at commercial banks 
according to the ABA surveys. The principal reason for these differ- 
ences in turnover rates lies in the broader coverage of commercial bank 
time deposits, and in institutional differences between the several types 
of savings institutions. They are, however, not of sufficient magnitude 
to preclude drawing some inference as to the behavior of turnover rates 
of time deposits prior to 1934 from data for mutual savings banks. 
Such data can be computed for New York State institutions for more 
than half a century." | 

Since 1900, the annual rate of turnover of New York State mutual 
savings bank deposits has Iuctuated within the relatively narrow range 
of 0.24 and 0.34, with the exception of 1929 (when it was 0.40) and most 
war years (when it was somewhat lower). During the twenties, it 
fluctuated at a higher average level than during the first two decades 
of the century or the years since 1934. In recent years, the velocity of 
savings deposits in New York savings banks has been still considerably 
lower than in the twenties, although somewhat higher than during 
World War II years. It reflects clearly the speculative peak of 1929 
(when the rate of turnover rose to a level of one-third above that of the 
preceding years), the contractions in 1907 and 1937, and the uncertain- 
ties caused by the declarations of war in 1917 and 1941. The increase 





13 Turnover rates for New York State savings deposits (which are heavily weighted by New York 
City institutions with higher than average deposit activity) are somewhat higher than those for all U.S. 
mutual savings banks, but they exhibit month-to-month fluctuations which are very similar to those 
of the U. S. series. About half of the total national deposits is accounted for by New York State. 
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in 1950-51, traceable to the outbreak of hostilities in Korea, was rela- 
tively minor. 


Cuarr II. Annual Rates of Turnover of Regular Deposits* in New York 
State Mutual Savings Banks and of Private Share Capital in all U. 8. Savings 
and Loan Associations. Monthly 1945-51. 
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* Regular deposiis including school savings accounts. 


Source: For savings and loan association data, Home Loan Bank Board. For mutual savings bank 
data, Savings Banks ‘Trust Co. 


Turnover raies for New York State savings banks suggest that an in- 
crease in the public’s desire to hold currency, resulting from an impair- 
ment of public confidence or from other causes," is reflected in with- 
drawals of savings deposits as well as of demand deposits. The velocity 
of savings deposits, however, rises very little or not at all in periods of 





4 For a detailed analysis based on investigations made by several New York savings banks, see 
Weldon Welfling, Savings Banking in New York State (Duke University Press, 1939), pp. 156 ff. Dr. 
Welfling remarks, however: “The stock market activity would not necessarily produce an increase in 
withdrawals and accounts closed, for an increase in loans on passbooks would serve the same purpose to 
the depositor.” 
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business expansion. Thus, the rate of turnover of deposits of New York 
savings banks did not increase in the late 1920’s until 1929, lagging 
behind the velocity of demand deposits at commercial banks outside 
New York City which rose nearly one-third during this period. Again, 
while velocity of demand deposits increased by one-third from the low 
point in 1949 to the peak in the spring of 1951, the rate of turnover of 
savings bank deposits (and of shares of savings and loan associations) 
rose very little, perhaps 10 per cent. As already noted, it declined dur- 
ing the expansion of business activity during World War II, when, 
however, the activity of demand deposits at weekly reporting banks 
also slowed down. 


Rates of Turnover of Deposits in New York State Savings Banks 








1900 26 1920 .33 1940 
1901 26 1921 32 1941 
1902 25 1922 31 1942 
1903 26 1923 .33 1943 
1904 26 1924 .32 1944 


1905 27 1925 .33 1945 
1906 29 1926 32 1946 
1907 31 1927 32 1947 
1908 28 1928 33 1948 
1909 25 1929 -40 1949 


1910 26 1930 32 1950 
1911 . 26 1931 34 1951 
1912 26 1932 32 
1913 26 1933 32 
1914 25 1934 26 


1915 24 1935 26 
1916 -22 1936 26 
1917 26 1937 27 
1918 26 1938 -25 
1919 .30 1939 24 





Source: Saving: Bank Trust Company. Chart Book. 


Similarly, the rate of turnover of savings deposits in the ABA sample 
of commercial banks declined during the period of expanding business 
activity, 1941-44, and again after the reconversion, 1947 through 1949." 





% The lack of responsiveness of the rates of turnover of savings deposits to an expansion in business 
activity is confirmed by the behavior of Postal Savings deposits. The annual rate of turnover of Postal 
Savings declined from 0.89 in 1921 to 0.68 in 1925 and 0.61 in 1928. Similarly, it dropped from 0.70 in 
1940 to 0.47 in 1945, 

The annual rate of turnover of such deposits has been two or three times higher than that for New 
York State savings deposits, except for the first years following the institution of the Postal Savings Sys- 
tem in 1911, when the rates were even higher because deposits were small. 
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Slight increases occurred in 1945 and 1950 and a rather sharp rise in the 
reconversion year 1946. 

The relative stability of the rate of turnover of New York State 
savings deposits over a period of more than fifty years is entirely com- 
patible with a declining trend in the rate of turnover of total time de- 
posits,'® which reflects the gradually diminishing importance of time 
deposits other than those evidenced by passbooks. There are two addi- 
tional reasons for presuming that, prior to the Banking Act of 1933, 
time deposits at commercial banks were more active than during the 
period covered by the Board’s study of weekly reporting member 
banks: 

1. In the 1920’s, it was possible to make withdrawals from savings 
accounts in some banks without personal presentation of passbooks. 
Opinions differ regarding the extent to which holders of time deposits 
considered them practically equivalent to demand deposits and were 
actually able to draw checks against them.!? There is a voluminous 
literature on the subject, and views range from Schumpeter’s assertion 
that “in the twenties time and demand deposits were essentially the 
same kind of thing”!* to Currie’s reluctance to consider time deposits 
essentially different from holdings of Government securities.’® 

It is uncertain whether the possibility of facile drawing against time 
deposit accounts was actually of material effect on the rate of with- 
drawals. On the other hand, after 1933, the prohibition of the payment 
of interest on demand deposits and the introduction of service charges 
(and their subsequent increases) may have induced some small deposi- 
tors to make more active use of their savings accounts to cash checks 
and to purchase cashier’s checks for making payments rather than to 
maintain separate but onerous checking accounts. 

2. It is also frequently claimed that, before 1933, part of the time 
deposits in commercial banks actually represented misclassified de- 
mand deposits.?° There are no direct quantitative data, and none are 
likely to become available, to resolve the conflicting views on the mean- 
ing of the growth of time deposits at commercial banks during the 





6 From the end of 1934 to 1939, when the turnover rates of the more homogeneous New York 
savings deposits fluctuated around a horizontal level, the velocity of time deposits at weekly reporting 
member banks outside New York City clearly showed a downward drift (Chart I). 

17 See Federal Reserve System Report of the Committee on Member Bank Reserves (Washington, 
1931), p. 7. 7 

18 J, A. Schumpeter, Business Cycles (New York, 1939), Vol. II, p. 856. 

19 Currie, op. cit., p. 14. 

20 See, for instance, B. M. Anderson’s comments in The Chase Economic Bulletin (Nov. 8, 1926), 
p. 14 n. As already mentioned, this view was also held very strongly by Schumpeter. Woodlief Thomas 
wrote: “In the 1920's, they [time deposits] could be easily withdrawn from many banks, and sometimes 
included funds that might otherwise have been in du:nand deposits.” (Banking Studies, p. 302). 
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twenties. The statistical evidence (supplementing personal knowledge 
of the existing practices) used by those who championed the opposing 
views, seems to have been limited essentially to data on the rate of 
growth of time deposits in different classes of banks and on the average 
size of time deposit accounts. The growth of time deposits, rather than 
any direct evidence of the classification of active deposits in the time 
category, gave rise to the views held by Anderson and others that dur- 
ing the twenties part of time deposits actually represented demand 
obligations.” 

It is impossible to arrive at a firm conclusion as to the extent to which 
bankers might have misclassified demand deposits or given special 
drawing privileges to owners of time deposit accounts. In any case, in 
recent years more rigid standards applied by bank examiners must have 
made the use of time deposits for current transaction purposes ex- 
tremely difficult (even though increased reserve requirements for de- 
mand deposits have made misclassification of deposits more attractive 
to bankers). It seems reasonable to assume that the velocity of time 
deposits has been declining over time primarily because of the decreas- 
ing influence of the more active time certificates of deposit and open 
accounts. Since these types of accounts are now reduced to insignifi- 
cance except in large money centers, it is likely that the velocity of 
time deposits will exhibit a stability similar to that of savings bank 
deposits. 


INTERPRETATION OF THE VELOCITY OF TIME DEPOSITS 


The concept of velocity of deposits has a precise analytical meaning 
only if withdrawals can be equated with use for transactions purposes. 
It is, however, not any more feasible to distinguish statistically in the 
case of time than in the case of demand deposits between payments for 
goods, services, or the acquisition of assets and the mere transfer of 
balances between different depositaries or accounts of the same holder. 

Such fragmentary data as are available seem to indicate that ac- 
tivity of time and savings accounts represents transfers of balances 





21 Professor French, who has carefully examined the statistical data for national banks for 1922-28 
concluded that “the bank statistics show clearly that there could have been no appreciable amount of 
shifting of deposits from the demand to the time category on the part of large depositors in national 
banks. If there was any such shifting, it was on the part of small depositors. We may safely say that the 
same conclusion would be valid for other commercial banks.” See D. R. French, “The Significance of 
Time Deposits in the Expansion of Bank Credit, 1922-28,” Journal of Political Economy, Dec. 1931. 
French's statistical analysis was confirmed by his interviews with executives of California corporations 
holding time deposits. 
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between savings institutions,” shifts among alternative forms of in- 
vestment or accounting entrances (such as expiration and renewal of 
eertificates of deposit) at least as much as it represents payments for 
goods and services.” 

To clarify the significance of the turnover rate of savings deposits, 
two aspects of account activity are of particular interest: (1) the sea- 
sonal pattern of turnover rates (which is nearly identical with that 
of withdrawals, since savings balances are very stable) and (2) the 
close month-to-month correspondence between the amount of new de- 
posits and withdrawals. 

Most New York savings banks pay interest at the end of June and 
December, while others credit interest quarterly. The seasonal pattern 
of turnover of deposits in New York savings banks (based on 1928-45) 
shows four peaks following these quarterly interest dates.** The highest 
peak occurs in January when rates of turnover are about 50 per cent 
higher than in the two immediately preceding and two following 
months; the July peak is somewhat lower, while the April and October 
activity is only about 30 per cent higher than in the eight months of 
slow activity.” 

This pattern, which reflects that of withdrawals from savings ac- 
counts, may be explained largely in terms of interest payment dates.” 
Since the general practice is to credit interest only for amounts remain- 





2 This situation is indeed paralleled when demand deposits are shifted among several accounts 
(at the same or at different institutions) of the same individual or firm. Since, however, the transaction 
velocity of demand deposits is so much higher, the proportion of transfers to total transactions is likely 
to be considerably lower than in the case of time deposits. 

23 See Weldon Welfling, “Some Characteristics of Savings Deposits,” American Economic Review, 
December 1940, pp. 748-58. 

* Quarterly data on withdrawals collected by the National Bureau of Economic Research (for one 
New York savings bank for 1880-1923 and for five such banks for 1900-1947) consistently show peaks 
in the first and third quarters, corresponding to the January and July peaks of the more inclusive but 
shorter series used here. 

% The available data for time deposits of weekly reporting member banks outside New York City 
show a broadly similar seasonal pattern, but the April and October peaks are much less pronounced than 
those in January and July. 

The seasonal pattern of withdrawals from savings and loan associations (based on 1944-51 data 
for the entire United States) shows two sharp peaks, in January and July (which are even sharper than 
for New York mutual savings banks) reflecting payment of dividends which in some cases is made auto- 
matically by check. The rates of turnover of the private share capital of U. 8. savings and loan associa- 
tions decline gradually during each of the five months following the two months of peak withdrawals (if 
the small dip in February is removed by placing the index on the daily average base). 

In contrast, December is the peak month for rates of turnover of demand deposits at the weekly 
reporting member banks, although the predominance of December has been declining in the moving 
seasonal factors computed by the Federal Reserve Bank of New York. November is the second highest 
month of demand deposit velocity. January is an average (or slightly above average) month, while July 
has been consistently below average, reflecting the slowing down of business activity during the vaca- 
tion period. High rates of turnover of demand deposits in the two last months of the year reflect Christ- 
mas business, the movement of crops, and a multiplicity of business and personal end-of-year payments. 

% Tax payment dates and seasonal need for extra cash (for payment of Christmas charge accounts 
in January and for vacation money in July) might be additional influences. 
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ing on deposit during the entire interest period, depositors who do not 
need funds urgently usually prefer to postpone withdrawals until the 
end of the interest period in order to earn the full interest. In particular 
investors who transfer their accounts from one institution to another 
would do it normally after interest for the last perod has been credited. 
Individuals, such as retired people, who use the income from savings 
accounts (and the principal as well) to meet current expenditures, are 
likely to make withdrawals after the (semiannual or quarterly) interest 
payment dates. Finally, since there is in New York State an upper 
limit (prior to 1951, 7,500 dollars) on individual savings bank accounts, 
interest on deposits which are at the maximum is normally withdrawn 
because no interest is paid on the excess amounts. Although the propor- 
tion of the accounts which are at the legal limit is small, they account 
for a relatively large share of total deposits. 

While interest payment dates provide a sufficient explanation for the 
peaks in withdrawals (and in the rate of turnover) we must seek other 
reasons for the very close correspondence between monthly withdrawals 
and new deposits: Indeed, as Chart III shows, high monthly withdraw- 
als are usually associated with high levels of new deposits and low 
withdrawals with low deposits. 

There are several possible explanations for such a close correlation. 
A transfer of funds from one savings bank to another is reported both 
as a withdrawal and a deposit. A more important explanation seems to 
be, however, that a large proportion of account activity represents use 
of savings accounts for the purpose of cashing occasional checks; many 
people deposit checks to their savings accounts and withdraw part or 
the entire amount in cash.’ 

Thus it is likely that, in addition to interest payments, the January 
peak in account activity reflects cashing of dividend, bonus, and Christ- 
mas checks. In other quarterly months, similar deposits of dividend 
and other quarterly checks are made and a large proportion of the pro- 
ceeds is withdrawn in cash. 

The conclusion seems justified that velocity rates for savings deposits 





27 There are among savings accounts a number of small but reiatively active accounts which are 
apparently maintained primarily to cash checks rather than to build up a backlog of savings. An analy- 
sis of a sample of widely scattered savings departments of commercial banks and mutual savings banks, 
revealed that in 1937-40 most accounts which averaged more than one withdrawal a month accounted 
for less than 5 per cent, and in only exceptional cases for as much as 10 or more per cert, of total de- 
posit liability. Such accounts were responsible, however, for from 10 to 50 per cent of the number of 
withdrawals in individual institutions. 

A much smaller sample showed that in 1943-44 the proportion of active accounts declined, but that 
such accounts were considerably more active than during the years immediately preceding World War 
II. Most likely, these changes reflect temporary wartime developments (see Savings Banks Trust Com- 
pany, Chart Book, Chart A-26). 
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are not necessarily indicative of the rate at which balances are used for 
transaction purposes. Part of the activity of savings deposit accounts 
reflects shifts of investments among various media, including transfers 
from one bank to another. Rates of savings deposit turnover reflect the 
use of balances, but also the use of the product of such balances (inter- 
est) and—perhaps even more—the utilization of check cashing facilities 
which these balances provide.?® 

Inter-city difierences in rates of time deposit turnover also reflect 
differences in the relative importance and activity of time deposits of 
business firms and of government units and in accounting practices. 
All this, together with the existence of a strong seasonal pattern, makes 
the interpretation of changes in the rates of time deposit turnover 
rather difficult. 

The main conclusions of this paper may be summarized as follows: 

1. Savings deposits at commercial banks turn over about once every 
two years, or only about one-fiftieth as rapidly as demand deposits. 

2. Because time deposits (other than interbank and U. S. Govern- 
ment) at commercial banks include several types of time accounts in 
addition to savings deposits, their velocity is higher than that of savings 
deposits alone, but still less than one. (The relatively higher turnover 
of time deposits in New York City member banks compared with 
“outside” banks reflects a higher percentage of time deposits other 
than savings deposits.) 

3. There has been a downward drift of the rate of time deposit turn- 
over in commercial banks outside New York City reflecting the relative 
increase of savings deposits in total time deposits of these institutions. 
The relative proportion of savings deposits in reported time deposits 
has been increasing continuously since the establishment of the Federal 
Reserve System, thus narrowing the difference between rates of turn- 
over of commercial bank time deposits and savings deposits. Currently, 
more than 94 per cent of time deposits at commercial banks outside 
New York City are savings deposits. It is, therefore, likely that changes 
in the velocity of savings bank accounts approximate closely those of 
savings and also time deposits at commercial banks outside New York 
City. 

4. Deposits at savings banks and shares of savings and loan associa- 
tions turn over about half as rapidly as savings deposits at commercial 





28 A survey made of several savings institutions during the last war indicated that checks cashed or 
partly cashed (a portion of the cash being deposited) were between } and } the number of regular deposit 
and draft transactions. (See Savings Banks Trust Company, Chart Book, Chart A-26.) In some cases, 
checks are chased in full as a courtesy, without requiring them to be deposited. 
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banks. This difference in level of velocity (changes being similar) is 
attributable to institutional factors. 

5. The rate of turnover of savings deposits is more stable than that 
of demand deposits. This stability reflects partly the failure of savings 
deposits to become more active when business expands. 

6. The difference in the nature and use made of savings and demand 
deposits is reflected in significant differences in the seasonal patterns of 
their velocities. 

7. A relatively small proportion of savings accounts with nominal 
balances are used for cashing salary checks and for the temporary safe- 
keeping of funds causing them to influence disproportionately the ac- 
tivity of total deposits of a savings institution. 

If these conclusions based on fragmentary and heterogeneous data 
surveyed have general validity, it would appear that those who are re- 
luctant to include time deposits in the money supply can find in them 
substantial arguments for their position. Indeed, if distinction between 
money and other types of liquid assets is approached not as a logical but 
as an empirical problem, it is apparent that average turnover rates of 
between 20 and 50 times a year are significantly different from those of 
once every two or four years. A significant part of the “activity” of 
savings accounts does not reflect withdrawals for making payments, 
but arises from the closing of accounts, from the cashing of salary and 
other checks (which are ultimately debited against demand deposit ac- 
counts at commercial banks thus affecting velocity data based on such 
debits), and from the transfer of funds (including interest received) to 
other savings institutions. Analysis of the available statistical data on 
the rates of turnover of savings and time deposits shows, furthermore, 
that these rates do not exhibit the same cyclical conformity or the same 
seasonal patterns as the rates of turnover of demand deposits. 

Considerably more justification could be found for including time 
deposits other than savings deposits with the money supply, although 
for recent years the amounts involved are very small in relation to de- 
mand deposits. 








CHANGES IN THE FUNCTIONAL DISTRIBUTION 
OF INCOME 


JESSE BURKHEAD* 
Syracuse University 


HE increased quantity and quality of national income statistics 

which have become available within the past few years make possi- 
ble a re-examination of significant relationships within the U. 8S. econ- 
omy. The present study represents an examination of one such relation- 
ship—labor and property shares in the national income. For this pur- 
pose the detailed accounts prepared by the National Income Division 
of the Department of Commerce have been utilized for the years 1929- 
1950. The conclusions for this period are then compared with the 
Kuznets data for the years 1919-1929. 

The present study is intended to serve two purposes: 

First, it will examine the common generalization that wages and 
salaries are a stable proportion of national income.! This generalization 
turns out to be valid only in relation to a particular set of national in- 
come concepts. When other concepts are employed it appears that 
labor shares in national income are not stable, but shift significantly 
from year to year and over a period of years. Section I will be devoted 
to an examination of alternative national income concepts which will 
differ substantially from those employed by the Department of Com- 
merce. Data on relative shares (changes in the distribution of income by 
type) will be presented and examined in Section II and in Section III. 

Second, this study will investigate changes in labor and property 
shares in the last three decades, but with particular reference to the 
decade of the Forties. That is, attention will be centered on the be- 
havior of labor and property shares under inflationary conditions. The 
data show a general pattern: increases in economic activity are associ- 
ated with a reduction in labor shares of total income; decreases in eco- 
nomic activity are associated with an increase in labor shares of total 
income; inflationary conditions appear to accentuate the decliiie in 
labor shares which is associated with increases in activity. The rationale 
for this behavior pattern will be considered in Section IV.? 





* The author is indebted to Joseph A. Pechman of the U. S. Treasury Department for criticisms of 
an earlier draft of this article. 

1 See, for example, Paul A. Sa Ison, Ec ics (McGraw-Hill Book Co., 1951), p. 231. 

2 In the literature, the problem of inflation has generally been treated in aggregative terms, as an 
“excess of demand over supply,” in terms of the rate of inflation and its measurement as a departure 
from assumed stability conditions. See, for example, Arthur Smitnies, “The behavior of money national 
income under inflationary conditions,” Quarterly Journal of Economics, November 1942, 113-138; 
Tjalling Koopmans, “The dynamics of inflation,” Review of Economics and Statistics, May 1942, 53-65. 
See also the literature on the inflationary gap, such as Walter A. Salant, “The inflationary gap,” Ameri- 
can Economic Review, June 1942, 308-313; Milton Friedman, “Discussion of the inflationary gap,” 
tbid., 314-20. 
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Before examining the data in detail it will be necessary to set forth 
the conceptual basis for the measurement of labor and property shares 
in total income. 

I 

National income concepts have no independent validity; they must 
be appropriate to the purposes at hand.* The present purpose is the 
measurement of the relative magnitude of labor income and of property 
income over a period of time. Two basic concepts (and some variants 
thereof) will be employed. The first basic concept will be labeled the 
“primary” distribution of income. The second, the “claims” concept of 
income. 

The primary distribution of income will measure labor and property 
shares derived from current economic activity. Therefore, income 
equals earnings paid or accrued for the use of property or personal serv- 
ices. This concept is similar to “earning power” measurements of in- 
come; and to the concept of producers’ income.® 

The second basic concept—income claims—attempts to measure 
labor and property shares in terms of relative economic power. Claims 
on goods and services reflect not only earnings and accruals from cur- 
rent ou*»ut but also earnings from the purchase and sale of assets (net 
capital ~ains) and from income receipts which are not derived from 
current production. The claims concept of income is, therefore, equal 
to the primary distribution of income plus net capital gains plus trans- 
fer payments. This brings the concept closer to Haig’s definition of in- 
come as “the money value of the net accretion to one’s econemic power 
between two points of time.”® 





Much less attention has been paid to structural problems—to the changes within the economy 
during inflationary periods. In one of the few contributions to this subject Holzman points out that only 
two of the ten economists represented in the symposium on inflation in the August, 1949 Review of Eco- 
nomics and Statistics mentioned distributional aspects of inflation (See Franklyn D. Holzman, “Income 
determination in open inflation,” Review of Economics and Statistics, May 1950, 150-58). Another con- 
tribution to structural relationships is Ralph Turvey, “Period analysis and inflation,” Economica, 
August 1949, 218-27. 

3 Simon Kuznets, “National Income,” Encyclopedia of the Social Sciences (New York, 1934), Vol. 
XI, p. 206. 

4 For a discussion of the “earning power” concept of income see, Concentration and Composition of 
Individual Incomes, 1918-1937, TNEC Monograph No. 4, Washington, 1940 (by Adolph J. Goldenthal), 
pp. 9-11. 

5 Tibor Barna, Redistribution of Incomes Through Public Finance in 1937 (Oxford, 1945), p. 17. 

*R. M. Haig, The Federal Income Tax (Columbia University Press, 1921), p. 7. (Cited by Roy 
Blough and W. W. Hewett, “Capital gains in income theory and taxation policy,” Studies in Income 
and Wealth, Vol. II (National Bureau of Economic Research, 1938), p. 197.) Haig would define “net 
accretion” to include consumer expenditures during the period. A complete application of the “economic 
power” concept of income would require the inclusion of both realized ard unrealized net capital gains, 
but no data are available on the latter. Also, it would be necessary to adjust income receipts and accruals 
in accordance with personal taxes. No attempt has been made to deal with this adjustment on the data 
presented here. However, an effort along these lines, based on Department of Commerce income con- 
cepts, has been recently attempted. See Edward F. Denison, “Distribution of national income,” Survey 
of Current Business, June 1952, 22-3. 
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The remainder of this section will be devoted to examination of the 
conceptual basis of the primary distribution of income, and the be- 
havior of labor and property shares when measured on this basis. The 
claims concept of income will be examined in Section III. 


PRIMARY DISTRIBUTION OF INCOME 


The measurement of labor and property shares in accordance with 
the primary distribution of income is set forth in detail for the years 
1929-1950 in Appendix Table A. Three concepts of total income have 
been employed. Labor income is identical in each of these three. Con- 
cepts II and III differ from Concept I in their treatment of property 
income. 

Labor income (Concepts I, II, III) includes: 


1. The compensation of employees 


This embraces wages and salaries in the private, military and gov- 
ernment civilian sectors, and the supplements to wages and salaries, 
including empioyer contributions to social insurance and to private 
pension funds. The compensation of corporate officers is treated here as 
labor income, although a part of such compensation should probably 
be classified as property income. That is, corporation managers may be 
in a position to “divert” corporete net income from stockholders to 
managers. The amount of suck diversion is likely to be small in relation 
to the total of property income and will probably exert no perceptible 
influence on relative shares.’ 


2. Income of unincorporated enterprise less inventory valuation adjust- 
ment. 


It is exceedingly difficult to determine whether this type of income is 
a labor or a property share. Entrepreneurial income is a mixture of 
payment for personal services (labor income) and return from property. 
In some instances, as with the income of professionals, it may be pre- 
dominantly attributable to labor. In other instances, as in unincorpo- 
rated manufacturing enterprise, it may be predominantly a return to 
capital. Not only will the division of this type of income between labor 
and property vary from industry to industry, it will also vary within an 





7 For example, in 1939 the compensation of corporate officers was about 9 per cent of the total 
of wages and salaries in the corporate sector. In 1948 it was about 7.5 per cent (U. 8. Department of 
Commerce, National Income, 1951, p. 156). On the other hand, there has undoubtedly been a growth 
in pension plans, stock purchase plans and expense accounts available to executives. It is not possible 
to estimate the volume of this kind of income or determine whether it represents property income or 
labor income. j 
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industry. Furthermore, there is no reason to assume that the labor- 
property relationship within the total of income from unincorporated 
enterprise will remain constant from year to year. 

The problem of classification of income of unincorporated enterprise 
becomes, then, the logically impossible feat of separating the insepara- 
ble parts of a whole. However, the sector is too large to be ignored; in 
1950 it accounted for 15 per cent of the total of national income. Chart 
I shows the movements in the totals of property income, labor income 
and income from unincorporated enterprise for the years 1929-1950. 
No clear pattern of relationship is established by the fluctuations in the 
three series. During the depression and recovery years the income of 
unincorporated enterprise generally moved in the same direction and 
to the same degree as property income. But in the war years its move- 
ments were comparable to those of employee compensation. 


Cuart I. Compensation of Employees, Property Income, Income of Unincor- 
porated Enterprise, 1929-1950. 
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The industrial composition of income of unincorporated enterprise 
may provide a somewhat better basis for its classification as labor in- 
come or property income than its behavior pattern. Although the com- 
position of income of unincorporated enterprise varies from year to 
year, four categories generally comprise its greatest bulk: agriculture, 
retail trade, medical and other health services, and legal services. (In 
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1950 these four made up 72 per cent of the income of the sector.) Since 
these four categories, in turn, appear to be predominantly labor-type 
activities, it would appear most reasonable to classify the whole sector 
as labor income. This would appear to be a better procedure than at- 
tempting to estimate its changing labor and property segments year 
by year. 

The classification of income from unincorporated enterprise as labor 
income would therefore appear to be justified on the ground of its in- 
dustrial composition. The advantage of this classification is that it ob- 
literates the shifts between the self-employed and wage earners; that is, 
it eliminates changes in labor shares that would otherwise result from 
changes in the proportion of the labor force which is self-employed. The 
inescapable disadvantage is that labor income, defined to include the 
income of unincorporated enterprise, will contain an uncertain and 
variable component of property income. 

Property income, Concept I, includes: 


1. Rental income of persons 


Although this type of income reflects some element of personal serv- 
ices provided by landlords to tenants, no breakdown of this portion is 
possible and the receipt as a whole is predominantly property income. 


2. Corporate profits after tax, without allowance for inventory valuation 
adjustment. 


This measurement of property income in the corporate sector raises 
two questions: 

a. In the consideration of income shares derived from current pro- 
duction it is appropriate to measure corporate profits after tax rather 
than corporate profits before tax. Although corporate profits before tax 
are, in some sense, available for the payment of wage increases, once 
the wage bill is paid, corporate profits taxes are not available to either 
labor or property income recipients; the taxes may not be spent by the 
corporation or its stockholders. The measurement of corporate profits 
net of taxes has the further advantage that it puts all property income 
on a uniform basis with respect to taxation, that is, all property income 
is then measured net of business taxes but gross of personal taxes.° 





8 Kuznets seems to be inclined toward the view that, entrepreneurial income is predominantly labor 
income. See Simon Kuznets, National Income and Its Composition, 1919-1938, Vol. I (National Bureau 
of Economic Research, 1941), p. 82. 

® The controversy concerning whether corporate profits taxes are reflected in product prices is not 
germane to the approach here, where income is measured in terms of shares paid or accrued. 
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However, for illustrative purposes, a measurement of property income 
which includes corporate profits taxes is set forth in Concept II. The 
differences in behavior exhibited by the two concepts will be examined 
below. 

b. Inventory valuation adjustment is appropriately excluded from 
the measurement of property shares. The Department of Commerce 
adjustment is for the purpose of arriving at the value of goods and 
services produced within the accounting period. But profits or losses 
on inventory should be included in a measurement of payments or ac- 
cruals to property income recipients from current activity. Such profits 
and losses are as much property income, and as much dependent on 
current operations, as profits from the production and sale of goods and 
services within the accounting period. 


3. Net interest 
This receipt is clearly and unambiguously property income. 


4. Net government interest 


Although excluded from national income, and treated, in effect, as a 
transfer payment in the Department of Commerce national income ac- 
counts, government interest is included here in the total of property 
income. Viewed from the standpoint of the recipient of government 
interest, this type of receipt is as much property income as interest on 
industrial bonds. The recipient of interest is the owner of property; 
this property yields a return in the form of government interest pay- 
ments. The inclusion of government interest is, therefore, appropriate 
when income flows are viewed from the standpoint of the recipient. It 
is equally appropriate to exclude government interest when, as in the 
Department of Commerce concepts, income flows are viewed in relation 
to the aggregate of current output.!° 

The sum of items 1+ represent property income, Concept I. Table 1 
shows a reconciliation between the concept of national income em- 
ployed by the Department of Commerce and the income concepts em- 
ployed here for the measurement of the relative shares of labor and 
property income paid or accrued to labor and property recipients. 

Two additional concepts of property income have been developed in 





10 In accordance with Department of Commerce procedure, government interest is measured ex- 
clusive of intra-governmental transfer, with the assumption that this amount may be appropriately 
imputed to individuals; that is, government interest receipts by financial institutions correspond to 
imputed service charges. (See National Income, 1951. p. 93.) 
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examining relative shares in accordance with the primary distribution 
of income. In Concept II corporate profits taxes have been added. This 
concept is developed primarily for purposes of comparison, and in ac- 
cordance with the convention that corporate profits taxes should he 
reflected in the national income. 

In Concept III property income reflects the addition of capital con- 
sumption allowances (corporate profits taxes are excluded). This treat- 
ment has the advantage of establishing a parity between labor and 


TABLE I 


RECONCILIATION: NATIONAL INCOME (DEPARTMENT OF 
COMMERCE), THE PRIMARY DISTRIBUTION OF INCOME, 
THE CLAIMS CONCEPT OF INCOME, 1947 








(Millions 
of dollars) 





198 , 688 
Deduct: 
Inventory Valuation Adjustment, Corporate 
Inventory Valuation Adjustment, Unincorp....... 
Corporate Profits Taxes 
Add: Government Interest 


The Primary Distribution of Income: 
Concept I 198 ,430 
Add: Corporate Profits Taxes 11,940 


Concept II 210 ,370 


Concept I 198 ,430 
Add: Capital Consumption Allowances 14,845 


Concept III 213 ,275 


The Claims Concept of Income: 
Concept I (primary distribution 198 ,430 
Add: Net Capital Gains, Corporate 924 
Net Capital Gains, Individuals 4,450 
Transfer Payments 11 ,803 
Dedt'~t: Contributions for Social Insurance 7,268 


208 ,339 
Add: Capital Consumption Allowances 14,845 


Concept III 223 , 184 
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property shares, since labor income, of necessity, includes “labor con- 
sumption allowances,” that is, the cost of maintaining labor’s earning 
capacity. Capital consumption allowances, as estimated by the Depart- 
ment of Commerce, represent amounts which are generally available to 
the property sector for purposes of maintaining capital equipment. 
Whether or not these allowances are adequate to maintain the capital 
“intact” is irrelevant for present purposes. The important point is that 
capital consumption allowances are available for expenditure by the 
property sector. 
II 


The behavior of labor and property shares in total income, where 
total income is based on the primary distribution of income, is depicted 
in Chart II. The underlying data are shown in Appendix Table A. The 
three concepts which have been discussed are plotted in terms of rela- 
tives, that is, as labor’s share of total income for each year 1929-1950. 


Cuart ITI. Labor Shares in Total Income: the Primary Distribution 
of Income, 1929-1950. 
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FLUCTUATIONS IN THE PRIMARY DISTRIBUTION OF INCOME 


Labor shares, measured in accordance with Concept I, rise sharply 
as economic activity declines from 1929 to 1932. With the progress of 
recovery after 1933, labor shares decline at the outset and then remain 
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reasonably stable for the remainder of the decade at from 80 to 83 per 
cent of total income. This is a substantially higher share than in 1929 
when labor shares were 75 per cent of total income. Since labor-prop- 
erty shares in 1929 were comparable to labor-property shares in most 
of the years of the Twenties, it follows that labor’s share of total in- 
come in the late Thirties was substantially higher than in the Twenties, 

The war period exhibits a striking increase in the share of total in- 
come going to labor, in marked contrast with the generally stable re- 
lationship from 1933 to 1941. The war economy distributed a substan- 
tially higher proportion of total income to labor than in the late Thirties. 
In 1945 labor’s share of total income was 88 per cent. This was 7 per- 
centage points more than in 1941 and 13 percentage points more than 
in 1929. 

The postwar period, marked by a decline in real output and an in- 
crease in prices, is associated with a decline in labor’s share of total 
income. The downward movement carries labor shares almost but not 
quite back to the levels of 1933-1941. Interestingly enough, the 1949 
recession is marked by a slight increase in labor shares—consistent with 
the slight increase in the recession of 1938. Also there is a slight down- 
turn in labor shares in 1950, notable not so much for the magnitude of 
change, which is insignificant, but for the direction of change. From 
1941 to 1942 the mobilization program produced a sharp increase in 
labor’s share of total income. This seems not to have happened in the 
1950 mobilization. 

The behavior of labor shares in total income, where income includes 
corporate profits taxes, is shown in Concept IT, Chart II. The year to 
year movements are comparable to those of Concept I, where corporate 
profits taxes are excluded, except for the years 1939 to 1941 when corpo- 
rate profits taxes rose sharply. However, the inter-decade position of 
labor income is different when Concept II is employed. In World War 
II labor shares of total income (including corporate profits taxes) now 
appear to be no higher than in the late Thirties, and, in the post-war 
period, substantially lower than in the late Thirties. As has been argued 
above, it would not be appropriate to attach too much importance to 
conclusions based on a concept of income which includes corporate 
profits taxes, since these taxes should not properly be viewed as income 
paid or accrued from current activity." 





11 A recent study of wage shares in the national income of the United Kingdom shows a very dif- 
ferent behavior pattern than for the United States. The war years, for example, show low rather than 
high ratios of wage income to the total. (See E. H. Phelps Brown and P. E. Hart, “The Share of Wages 
in National Income,” Economic Journal, June 1952, 253-77, especially p. 265.) However, the income 
concepts employed by Messrs. Brown and Hart are not wholly comparable with those utilized here. 
Profits, for example, are measured gross of profits taxes. 
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In Concept III, Chart II, capital consumption allowances are in- 
cluded in property income, and a sharply different behavior pattern 
emerges than when capital consumption allowances are neglected. 
Property income shares are not only considerably larger but now show 
much greater stability through the years of most serious depression. 
In 1932 capital consumption allowances are larger than all other prop- 
erty income. 

The inclusion of capital consumption allowances further accentuates 
the relative improvement in labor’s share during World War II. The 
shrinkage in labor shares in the postwar period is similar in magnitude 
to that exhibited by Concept I. However, Concept III shows that 
labor’s share remains higher in the postwar period than in the Thirties. 
This is to be attributed to the fact that capitai consumption allowances 
are smaller in relation to other income in the postwar period than dur- 
ing the Thirties. 

Perhaps the most important conclusion which emerges from the data 
shown in Chart II is that, employing either Concept I or Concept ITI, 
labor shares in total income rose sharply in World War II, receded 
during the postwar period, but remained somewhat higher than in the 
late Thirties. The economy appears to have been distributing a larger 
portion of its total income to labor over the past twenty years, although 
the postwar inflation has offset a part of this tendency. 


PRIMARY DISTRIBUTION OF INCOME IN THE CORPORATE SECTOR 


Department of Commerce data permit isolation and separate exami- 
nation of labor and property shares in the corporate sector. The be- 
havior of labor shares in accordance with varying concepts of income is 
set forth in Chart III; the data are shown in Appendix Table B. In 
general, Concepts I, II, and III are similar in definition to those utilized 
in Chart IT. 

Labor income, for all three concepts, is uniformly taken to include the 
compensation of employees of corporate enterprise. This compensation 
includes wages and salaries, employer contribution for social insurance 
and for private pension funds, and “other labor income.” Property in- 
come in Concept I includes corporate profits after tax and net interest 
paid by corporations. For Concept II, property income also includes 
corporate profits taxes. For Concept III property income excludes 
corporate profits taxes, but includes corporate depreciation, for which 
data are available since 1933.” 





12 Corporate depreciation has been used because it is not possible to isolate capital consumption 
allowances for corporations in the Department of Commerce accounts. 
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As Chart III shows, fluctuations in labor shares in the corporate 
sector are considerably sharper than the fluctuations in labor shares in 
the economy as a whole. For example, the 1937-38 recession and the 
1949 recession are considerably more pronounced. And, of course, the 
differences in behavior between Concept I and Concept II (corporate 
profits taxes included) are considerably sharper than for the economy 


Cuart III. Labor Shares in Total Income, Corporate Sector, the 
Primary Distribution of Income, 1929-1950. 
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as a whole. The inclusion of corporate depreciation (Concept III) seems 
to have an effect similar to the inclusion of consumption allowances for 
the economy as a whole. 

The conclusions from Chart III are similar to those which may be 
derived from Chart II. Labor shares in the corporate sector were higher 
in the late Thirties than in 1929, and higher in World War II than in 
the late Thirties. 

Postwar changes are more difficult to interpret. Where income is 
based on Concept I it would appear that the shrinkage in labor shares 





It wouid be appropriate to include net rents in property income for the corporate sector-rents and 
royalties received by corporations less net rent paid on business property. Unfortunately, complete 
data on this type of income are not available for all years. However, in dollar amounts, and in its effect 
on relative shares, this omission is unimportant. For example, in 1942, net rents of corporations amounted 
to —$8 million. (U. 8. Treasury Department, Statistics of Income for 1942, Part 2, pp. 328-29.) 
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was sufficiently pronounced to bring this sector back to about the same 
relative position that it occupied in the semi-prosperous years of the 
Thirties. However, where corporate depreciation is included (Concept 
III) the conclusion is comparable with that observed for the economy 
as a whole. The shrinkage in labor’s relative position in the postwar 
years still left this sector in a slightly improved status as compared 
with the Thirties. 


III 


The measurements of labor and property shares in total income, as 
set forth in Part II, are based on the primary distribution of income, 
defined as payments or accruals from current economic activity. It is 
also useful to explore the behavior of relative shares when income is 
measured on the basis of claims. This requires attention to the appropri- 
ate classification of net capital gains and of transfer payments. 


RELATIVE SHARES ON THE BASIS OF CLAIMS 


Capital gains are rather obviously property income and have been 
classified as such here. Data on realized capital gains are taken from 
the Seltzer study. Capital gains have been included in property in- 
come at net; that is, losses have been deducted from gains, and where, 
as in depression years, there are aggregate net losses, these are added 
algebraically to property income. Net gains realized by both individuals 
and corporations are included. 

Transfer payments are classified as labor j income. Again, it should 
be recalled that income is viewed here from the standpoint of the re- 
cipients. There can be little doubt that the recipients of labor income 
are, by and large, the recipients of transfer payments. In 1950, for 
example, $6,125 million of the $15,082 million of transfer payments 
represent benefits from social insurance funds. Military and veterans 
pensions and benefits, generally related to services previously per- 
formed, amounted to an additional $4,284 millions. 

In combining transfer payments with labor income as defined in ac- 
cordance with the primary distribution of income (Part II), it is neces- 
sary to deduct employer and employee contributions for social insur- 
ance and employer contributions for private pension funds from the 
total of labor income. This will avoid counting, as labor income, both 
employer and employee contributions to social insurance and pension 
funds and employee benefits from social insurance and pension funds. 





3 Lawrence H. Seltzer, The Nature and Tax Treatment of Capital Gains and Losses (National Bureau 
of Economic Research, 1951), pp. 367, 531. 











204 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1953 


This deduction has been made in Chart IV, and in Appendix Table C, 
which sets forth the data on which Chart IV is based." 

Chart IV shows labor’s share in total income in accordance with the 
claims concept of income; transfer payments are included in labor in- 
come and net capital gains in property income. Concept III differs 
from Concept I in that capital consumption allowances are included in 
the property income segment. 


Cuart IV. Labor Shares in Total Income: the Claims Concept of 
Income, 1929-1950. 
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A comparison of Concept I, Chart II, with Concept I, Chart IV 
shows that the measurement of income on a claims basis produces re- 
sults which are markedly different than the measurement of income on 
the basis of primary distribution. First, property income falls more 
sharply in the depression because of the impact of capital losses. Sec- 
ond, the wartime improvement in labor’s relative position is still sub- 
stantial, when compared with the Thirties, but is by no means as strik- 
ing as when net capital gains and transfer payments are omitted. 
Third, labor’s relative position in the postwar period appears to be al- 
most identical with its relative position in the late Thirties. This is a 
reflection of the fact that transfer payments are less significant in rais- 





4 Employee benefits from social insurance funds are included in transfer payments, but benefits 
from Lrivate pension funds are not included, and cannot be isolated in the national income accounts. 
The procedure which has been adopted here will, therefore, tend to understate labor income to the ex- 
tent of current benefits from private pension funds. 
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ing the labor share of total income in the postwar period share than are 
‘ capital gains in raising the pruperty share. 

When capital consumption allowances are included (Concept III), it 
will be observed that the effect of the net gains and transfer payments is 
somewhat diluted. The behavior of Concept III in Chart IV is very 
similar to the behavior of Concept III in Chart II. The depression 
shrinkage in property incomes occasioned by capital losses is, of course, 
reflected in Chart IV, but the wartime improvement in labor’s relative 
share appears to be less striking when net gains and transfer payments 
are included. For both the primary distribution of income and the 
claims concept of income, the inclusion of capital consumption allow- 
ances in property income tends to accentuate the improvement in 
labor’s relative position over the whole period. 

In conclusion, it would appear that the major differences between 
the measurement of labor and property shares in terms of the primary 
distribution of income and in terms of the claims concept of income 
are: 1) the claims concept of income shows labor’s relative improve- 
ment to be more modest during World War II than does the primary 
distribution of income, and, 2) the claims concept shows labor’s postwar 
relative position to be almost identical with its relative position in the 
late Thirties, while the primary distribution shows labor’s relative po- 
sition to be slightly improved in the postwar period. 


LABOR AND PROPERTY SHARES IN THE TWENTIES 


Data for the decade of the Twenties, comparable in refinement to the 
Department of Commerce accounts for 1929-50, are not available. 
However, the Kuznets estimates provide an adequate basis for broad 
comparison. 

Chart V, for the years 1919-1929, is based on Kuznets data, with 
concepts generally similar to those described above in terms of the 
primary distribution of income." As the chart indicates, from 1921 to 
1929 there was remarkable stability in the labor-property division of 
total income. 

Data are not available for the estimation of labor and property 
shares in the Twenties in accordance with the claims concept of in- 
come; the necessary information on transfer payments is lacking. How- 





4 For a detailed discussion of these concepts see Simon Kuznets, National Income and Its Composi- 
tion, 1919-1988, Vol. I (National Bureau of Economic Research, 1941), pp. 3-60, 215-65. 

The concept of “total income” as used in Appendix Table D and Chart V differs from Kuznets’ 
national income by the exclusion of government net savings. On the other hand, “total income” as used 
here differs from the primary distribution of income, as set forth in Chart I, principally in the exclusion 
of imputed rent on owned homes and food produced and consumed on farms. 
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ever, the Seltzer data on capital gains for individuals for these years 
would indicate that the stability exhibited in the labor-property rela- 
tionship would be substantially modified by the addition of net gains 
to the property segment.'* For example, in the years 1919-1923 net 
capital gains for individuals aggregate to an insignificant $30 million, 
But for the years 1924-1929 they aggregate to $15.8 billion. Total 
property income for these years amounted to %96.9 billion. For the 
single year 1928, in which net gains were most sig» ificant, their inclu- 


Cuart V. Labor Shares in Total Income, 1919-1929. 
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sion in property income would raise the relative share of property in- 
come in the total by about 20 per cent (four percentage points). 

If it may be assumed that transfer payments during the years 1919- 
1929 were minimal, it is evident that a claims concept of income for the 
Twenties, in contrast with a primary distribution concept, would show 
a downward drift in labor shares beginning in the year 1924, and con- 
tinuing through 1929. 


IV 


Before summarizing the general findings which emerge from this in- 
vestigation it may be useful to explore briefly the relationship between 





16 See Seltzer, op. cit., p. 367. Net gains of corporations are not available. 
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changes in labor and property shares of total income and resulting, or 
corresponding, changes in the distribution of personal income. 


RELATIVE SHARES AND THE DISTRIBUTION OF INCOME 


It is not possible to move directly from the measurement of distribu- 
tion of income by type to the distribution of income by size class be- 
cause Many persons receive income from several sources. Nevertheless, 
the total income of upper income groups is composed more largely of 
property income than is the total income of lower income groups, and 
this fact establishes a rough linkage between changes in labor and prop- 
erty shares and changes in concentration in the distribution of income. 
On this basis it could be expected that an increase in property shares 
in total income would tend to increase concentration in the distribution 
of income. It could be expected that an increase in labor shares of total 
income would tend to reduce concentration in the distribution of in- 
come. 

However, as Pechman has pointed out, a decrease in the ratio of. 
wages to total income can be produced in two ways: first, since wages 
are a smaller proportion of income at higher income levels, the ratio of 
wages to total income might decrease as income recipients are shifted 
upward in the distribution; second, an increased concentration of in- 
come might reduce the share of total income going to wages.'” Therefore, 
only when both total income and labor’s share are concurrently in- 
creasing can it be certain that concentration is reduced; similarly, only 
when both total income and labor’s share are falling can it be certain 
that concentration is increased. 

It follows that, strictly speaking, only the World War II case shows 
an irrefutable instance in which the concentration of income is reduced. 
In all other instances in the years 1919-1950 where labor shares rise or 
fall, total income is moving in the opposite direction. 

Nevertheless, the more generalized and less rigid hypothesis that an 
increase in labor shares is prima facie indication of reduced concentra- 
tion seems to be generally borne out by the conclusions that have been 
reached on the basis of other methods of analysis. On the basis of his 
hypothesis it would be concluded that, for both the primary distribu- 
tion of income and the claims concept of income, the distribution of 
income was less concentrated in the Thirties than in the Twenties and 
less concentrated during World War II than in the Thirties. Concen- 
tration increased in the 1946-1950 period as compared with World War 





17 Frank A. Hanna, Joseph A. Pechman, Sidney M. Lerner, Analysis of Wisconsin Income (Na- 
tional Bureau of Economic Research, 1948), pp. 69-75. 
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II, but the resulting degree of concentration was slightly less than in 
the late Thirties. 

The Kuznets study of upper income group shares indicates that con- 
centration decreased from 1939 to 1945 and increased somewhat after 
1945.18 The finding here that income distribution was less concentrated 
in World War II than during the middle Thirties is also in line with 
conclusions drawn from the surveys of family income for the years 
1935-36, 1941, 1942 and 1945.9 

On the other hand, the conclusion indicated in Chart II is that the 
1946-1948 inflation tended to increase property shares and, therefore, 
in all probability, tended to increase concentration in the personal dis- 
tribution of income. The survey of consumer finances conducted by the 
Board of Governors of the Federal Reserve System indicates that con- 
centration increased from 1946 to 1947, but was reduced in the three 
succeeding years.”° This is in conflict with the Chart II data, which 
shows concentration increasing continuously from 1946 through 1948, 
with a very modest reduction in concentration in 1949, and an increase 
again in 1950. There appears to be no way to reconcile these conflicting 
conclusions. 

The finding in Chart II that labor shares tend to move against the 
business cycle, that is, increase during declines in economic activity and 
decrease as economic activity increases, would tend to indicate that the 
primary distribution of income is more concentrated during expansions 
than during contractions. The Kuznets data on concentration as re- 
vealed by changes in the shares of the upper one per cent of income 
recipients (where property income dominates) would appear to support 
this conclusion.”! On the other hand, Mendershausen found that in 1933 
income was somewhat more concentrated than in 1929.% However, 
these findings were based on examination of the top 30 per cent of in- 
come recipients. Mendershausen found that the top one per cent, where 
property income is wholly predominant, showed a reduction in concen- 
tration between 1929 and 1933. 

It may be tentatively concluded that an increase in labor shares is 





18 Simon Kuznets, Shares of Upper Income Groups in Income and Savings (National Bureau of Eco- 
nomic Research, 1950), pp. 3-4. 

19 Morris A. Copeland, “The social and economic determinants of the distribution of income in the 
United States,” American Economic Review, March 1947, 57. Also, for a comparison of 1929 and 1937 
see Julius Wyler, “The share of capital in national income,” Social Research, November 1943, 436-54. 

20 Federal Reserve Bulletin, August 1951, pp. 929. 

21 Shares of Upper Income Groups in Income and Savings, p. 33 (Panel A). 

22 Horst Mendershausen, Changes in Income Distribution During the Great Depression (National 
Bureau of Economic Research, 1946), pp. 68-80. 
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generally associated with an increase in equality in the distribution of 
income. However, the evidence does not suggest that this is an exact 
relationship which holds for moderate year to year changes in labor and 
property shares. 


THE GENERALIZED BEHAVIOR OF LABOR AND PROPERTY SHARES 


The relationships shown in Charts II, III and IV reveal a general 
pattern of behavior. As the level of economic activity increases, labor 
shares tend to decline. As the level of economic activity decreases, 
labor shares tend to increase. The only marked exception to this pat- 
tern is the case of World War II, when, as economic activity increased, 
labor shares also increased. 

It would appear that there are a number of factors which ezontribute 
to this inverse relationship between labor shares and changes in eco- 
nomic activity. 

As economic activity increases, property incomes can be expected to 
gain in relation to labor incomes by what may be called the capacity 
effect. As the output of firms increases beyond the break-even point, 
there is a concomitant increase in the volume of profits. The traditional 
spreading of overhead, with correspondingly lower unit labor costs, 
produces a larger return to property per unit of output as output in- 
creases. The growth in the relative shares of property income recipients 
is certainly not without limit; this limit is associated with a dispropor- 
tionate increase in marginal costs.“ 

Where increases in economic activity are accompanied by inflationary 
conditions two additional factors will tend to increase the share of 
property income in relation to labor income. These may be termed the 
lag effect and the compounding effect. For present purposes the term in- 
flation will be used to describe a condition in which demand for final 
product is increasing with the prices of goods and services generally 
rising, but with some prices rising more rapidly than others. That is, 
inflation is both an aggregative and a differential phenomenon. 

The lag effect, which tends to increase property shares, is a corollary 
of the differential impact of price rises. A rising rate of effective de- 
mand confronts an output of goods and services which is increasing less 
rapidly than the increase in demand. The market for final product re- 
ceives the full brunt of the higher demand; the increased demand for 
labor is secondary. The seller of commodities becomes the immediate 
beneficiary of the rising prices. Entrepreneurial shares increase and the 








% For this point I am indebted to A. M. MclIsaac. 
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increase raises the shares of property income in the total.* The lag 
effect is not independent of the capacity effect. Increases in price for 
final product will raise the break-even point and broaden the range of 
profitable operation. 

Under inflationary conditions, a shift in relative shares of total in- 
come in favor of property income recipients will also result from what 
may be called the compounding effect. That which is compounded igs 
savings, and the compounding occurs through its reinvestment.”* Actu- 
ally, this effect operates independently of price rises during any period 
in which operating losses and capital losses of firms are at a minimum, 
However, such a situation is characteristic of an inflationary period. 
Further, as entrepreneurial income expands, relatively, in an inflation- 
ary period, the tendencies toward compounding will be reinforced, if it 
may be assumed that the marginal propensity to consume out of en- 
trepreneurial income is smaller than out of labor income. As long as 
inflationary conditions continue, with operating and capital losses at a 
minimum, those who have enjoyed the enhanced returns may further 
improve their economic position by earning income on income. The 
check on the compounding effect, is, of course, the appearance of oper- 
ating and capital losses. A reduced rate of return on investment will 
limit its operation but not offset it. 

The importance of each of these factors in the relative growth of 
property income under inflationary conditions will vary with the rate 
and duration of the inflation. Presumably, lag effect will be most pro- 
nounced when the rate of price increases is most rapid. Compounding 
effect, on the other hand, is likely to be of greatest importance when in- 
flationary conditions have been sustained for long periods of time. The 
capacity effect will depend on the level from which increases in eco- 
nomic activity proceed and will be most important when increases in 
output are initiated from a point where many firms are operating close 
to break-even. 

The operation of three factors—lag, compounding and capacity ef- 
fect—will alter labor and property shares in accordance with changes 
in the level of economic activity and the level of prices. Lag and ca- 
pacity effect, in particular, may be expected to be important in year to 
year fluctuations. 

These three factors are independent of inter-industry shifts which 





% The tendency of inflation to increase entrepreneurial shares has often been noted, but seldom 
explored in the literature. See Koopmans, op. cit.; and, of course, J. M. Keynes, How te Pay for the War 
(Harcourt, 1940), p. 6. 


% Gordon Hayes, Spending, Saving, and Employment (Knopf, 1945), pp. 35-42. 
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may alter labor and property shares in total income, particularly over a 
period of years. Most writers in the field have stressed the importance 
of these shifts in altering labor shares. In examining the upward move- 
ment in relative labor income between 1919 and 1938 Kuznets found 
that relative increases in employee compensation were traceable almost 
wholly to production shifts from industries in which the ratio of wages 
and salaries to property income was low to industries in which the ratio 
of wages and salaries to property income was high. Kuznets did not find 
a measurable tendency for particular industries to vary their ratio of 
wages and salaries to property income over this whole period.”* The 
Department of Commerce has also stressed the importance of inter- 
industry shifts in the explanation of the larger employee shares in 1950 
as compared with 1929.77 

Unfortunately, it is not possible to disentangle statistically the year 
to year effects of the three factors noted above from the effects of inter- 
industry shifts over a period of years on labor and property shares in 
total income. 

Finally, it should be noted that the general conclusion which emerges 
from the examination of labor and property shares in accordance with 
the concepts developed here is that they are by no means stable, either 
from year to year or over a period of years. This conclusion differs 
rather sharply from that reached by analysts in the Department of © 
Commerce who have stressed the stability in income shares, particu- 
larly in the employee compensation share of national income.”* The 
explanation for these differing conclusions lies in the concepts which 
are employed. The Department of Commerce analysis of employee 
compensation is based on a concept of income which includes corporate 
profits taxes and inventory valuation adjustment. Both of these are 
excluded from Concepts I and III as developed here; the latter Concept 
also includes capital gains and transfer payments. The importance of 
the conceptual basis in the measurement of relative shares is well illus- 
trated in the analysis of the corporate sector (Chart III and Appendix 
Table B). In 1929 the addition of corporate profits taxes to total income 
affects labor’s share (Concept I) by only 2.4 percentage points. In 1950 
this addition affects labor’s share by 11.1 percentage points. 


(Continued on page 218) 





2% Kuznets, National Income and Its Composition, pp. 241-50. 

27 Denison, “Distribution of National Income,” pp. 16-23. 

28 [bid.; Lawrence Groze, “Labor income in the postwar period,” Survey of Current Business, May 
1952, 7-13. , 
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TABLE A 
LABOR AND PROPERTY SHARES: THE PRIMARY DISTRIBUTION OF INCOME 


1929-1950 
(money figures in millions of dollars) 











1929 1930 1931 1932 1933 1934 1935 1936 





Property Sector: 
Be PRE OEE. oo vce cccsveves 5,811 4,786 3,620 2,508 2,018 2,095 2,288 2,682 
2. Corp. Profits After Tax......... 8,420 2,455 —1,283 —3,424 —362 977 = 2,259 Ss 4,273 
De I icncccckecssicacscs 6,541 6,176 5,938 5,430 5,010 4,750 4,539 4,474 
4, Govern’t Interest.............. 983 964 1,084 1,141 1,170 1,230 1,141 1,101 
5. Total Property (Concept I)..... 21,755 14,381 9,359 5,655 7,836 9,052 10,227 12,530 
Labor Sector: 
6. Income of Unincorp............ 13,927 10,963 8,214 4,921 5,207 6,603 9,858 9,942 
ee, eee 142 755 611 295 —525 —54 —50 —120 
8. Net Income of Unincorp........ 13,785 10,208 7,603 4,626 5,732 6,657 9,908 10,062 
9. Compensation of Employees..... 50,786 46,515 39,470 30,826 29,330 34,067 37,107 42,675 
10. Total Labor (Concept I)........ 64,571 56,723 47,073 35,452 35,062 40,724 47,015 52,737 
11. Total Property (Line 5)........ 21,755 14,381 9,359 5,655 7,836 9,052 10,227 12,530 
12. Total Income (Concept I)....... 86,326 71,104 56,432 1,107 42,898 49,776 57,242 65,267 
Ratios: Concept I 
13. Total Labor (Line 10).......... 74.8 79.8 83.4 86.2 81.7 81.8 82.1 80.8 
14. Total Property (Line 11)....... 25.2 20.2 16.6 13.8 18.3 18.2 17.9 19.2 
15. Total Property (Line 11)....... 21,755 14,381 9,359 5,655 7,836 9,052 10,227 12,530 
16. Add: Corp. Profits Taxes....... 1,398 848 500 382 524 746 965 1,411 
17. Total Property (Concept II).... 23,153 15,229 9,859 6,037 8,360 9,798 11,192 13,941 
18. Total Labor (Line 10).......... 64,571 56,723 47,073 35,452 35,062 40,724 47,015 52,737 
19. Total Income (Concept II)...... 87,724 71,952 56,932 41,489 43,422 50,522 58,207 66,678 
Ratios: Concept II 
20. Total Labor (Line 18).......... 73.6 78.8 82.7 85.4 80.7 80.6 80.8 79.0 
21. Total Property (Line 17)....... 26.4 21.2 17.3 14.6 19.3 19.4 19.2 21.0 
22. Total Property (Line 11)....... 21,755 14,381 9,359 5,655 7,836 9,052 10,227 12,530 
23. Add: Cap. Cons. Allowances.... 8,816 8,747 8,312 7,663 7,245 7,218 7,369 7,684 
24, Total Property (Concept III)... 30,571 23,128 17,671 13,318 15,081 16,270 17,596 20,214 
25. Total Labor (Line 10).......... 64,571 56,723 47,073 35,452 35,062 40,724 47,015 52,737 
26. Total Income (Concept ITI)..... 95,142 79,851 64,744 48,770 50,143 56,994 64,611 72,951 
Ratios: Concept III 
27. Total Labor (Line 25).......... 67.9 71.0 72.7 72.7 69.9 71.5 72.8 72.3 
28. Total Property Line 24)........ $2.1 29.0 27.3 27.3 30.1 28.5 27.2 27.7 








Source: Department of Commerce, National Income, 1951. 
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TABLE A (Continued) 
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1939 1940 1941 1942 1943 1944 1945 1946 1947 194 1949 1950 

3,465 3,620 4,322 5,395 6,109 6,495 6,256 6,620 7,059 7,506 7,545 8,039 
5,005 6,447 9,386 9,433 10,646 10,808 8,502 13,881 18,549 20,734 17,347 22,774 
4,212 4,104 4,113 3,894 3,355 3,137 3,009 2,922 3,544 4,335 4,908 5,386 
1,205 1,291 1,289 1,517 2,140 2,803 3,663 4,432 4,378 4,451 4,610 4,710 
13,887 15,462 19,110 20,239 22,250 23,243 21,430 27,855 33,530 37,026 34,410 40,909 
11,282 12,660 16,504 25,041 26,731 28,997 31,247 35,375 35,365 39,751 33,903 35,964 
—166 —52 —644 —372 —154 —70 -—113 —1,819 —1,547  —395 632 —1,552 
11,448 12,712 17,148 23,413 26,885 29,067 31,360 37,194 36,912 40,146 33,271 37,516 
47,820 51,786 64,280 84,895 109,212 121,163 123,026 117,098 127,988 140,166 139,887 153,333 
59,268 64,498 81,428 108,308 136,097 150,230 154,386 154,292 164,900 180,312 173,158 190,849 
13,887 15,462 19,110 20,239 22,250 23,243 21,430 27,855 33,530 37,026 34,410 40,909 
73,155 79,960 100,538 128,547 158,347 173,473 175,816 182,147 198,430 217,338 207,568 231,758 
81.0 80.7 81.0 84.3 85.9 86.6 87.8 84.7 83.1 83.0 83.4 82.3 
19.0 19.3 19.0 15.7 14.1 13.4 12.2 15.3 16.9 17.0 16.6 17.7 
13,887 15,462 19,110 20,239 22,250 23,243 21,430 27,855 33,530 37,026 34,410 40,909 
1,462 2,878 7,846 11,665 14,406 13,525 11,215 9,583 11,940 13,028 10,989 18,593 
15,349 18,340 26,956 31,904 36,656 36,768 32,645 37,438 45,470 50,054 45,399 59,502 
59,268 64,498 81,428 108,308 136,097 150,230 154,386 154,292 164,900 180,312 173,158 190,849 
74,617 82,838 108,384 140,212 172,753 186,998 187,031 191,730 210,370 230,366 218,557 250,351 
79.4 77.9 75.1 77.2 78.8 80.3 82.5 80.5 78.4 78.3 79.2 76.2 
20.6 22.1 24.9 22.8 21.2 19.7 17.5 19.5 21.6 21.7 20.8 23.8 
13,887 15,462 19,110 20,239 22,250 23,243 21,430 27,855 33,530 37,026 34,410 40,909 
8,101 8,440 9,294 9,981 10,680 11,887 12,410 12,163 14,845 17,612 19,058 21,177 
21,988 23,902 28,404 30,220 32,930 35,130 33,840 40,018 48,375 54,638 53,468 62,086 
59,268 64,498 81,428 108,308 136,097 150,230 154,386 154,292 164,900 180,312 173,158 190,849 
81,256 88,400 109,832 138,528 169,027 185,360 188,226 194,310 213,275 234,950 226,626 252,935 
72.9 73.0 74.1 78.2 80.5 81.0 82.0 79.4 77.3 76.7 76.4 75.5 
27.1 27.0 25.9 21.8 19.5 19.0 18.0 20.6 22.7 23.3 23.6 24.5 
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TABLE B 
LABOR AND PROPERTY SHARES IN THE CORPORATE SECTOR: THE PRIMARY DISTRIBUTION OF INCO 


1929-1950 
(money figures in millions of dollars) 











1929 1930 1931 1932 1933 1934 1935 1936 





Property Sector: 
1. Corp. Profits After Tax......... 8,188 2,318 —1,279 —3,390 —360 917 2,100 4,169 
re ee 1,617 1,742 1,802 1,759 1,715 1,707 1,626 1,557 
3. Total Property (Concept I)..... 9,805 4,060 523 —1,631 1,355 2,624 3,726 65,726 
4. Comp. of Employees........... 33,522 30,074 24,706 18,464 17,493 20,493 22,462 25,628 
5. Total Property (Line 3)........ 9,805 4,060 523+ —1,631 1,355 2,624 3,726 65,726 
6. Total Income (Concept I)....... 43,327 34,134 25,229 —_— 18,848 23,117 26,188 31,354 
Ratios: Concept I 
7. Total Labor (Line 4)........... 77.4 88.1 97.9 — 92.8 88.6 85.3 81.7 
8. Total Property (Line 5)........ 22.6 11.9 2.1 — 7.2 11.4 14.2 18.3 
9. Total Property (Line 5)........ 9,805 4,060 523 —1,631 1,355 2,624 3,726 5,726 
10. Add: Corp. Profits Taxes. ...... 1,398 848 500 382 524 746 965 1,411 
11. Total Property (Concept IT).... 11,2083 4,908 1,023 —1,249 1,879 3,370 4,691 7,137 
12. Total Labor (Line 4)........... 33,522 30,074 24,706 18,464 17,493 20,493 22,462 25,628 
13. Total Income (Concept II)...... 44,725 34,982 25,729 — 19,372 23,863 27,153 32,765 
Ratios: Concept II 
i4. Total Labor (Line 12).......... 75.0 86.0 %.0 — 90.3 85.9 82.7 78.2 
15. Total Property (Line 13)....... 25.0 14.0 40 — 9.7 14.1 17.3 21.8 
16. Total Property (Line 5)........ 1,355 2,624 3,726 5,726 
17. Add: Corp. Depreciation........ 3,498 3,366 3,357 3,287 
18. Total Property (Concept III). ... 4,853 5,990 7,083 9,013 
19. Total Labor (Line 4)........... 17,493 20,493 22,462 25,628 
20. Total Income (Concept II)...... 22,346 26,483 29,545 34,641 
Ratios: Concept III 
21. Total Labor (Line 19).......... 78.2 77.4 76.0 74.0 
22. Total Property (Line 18)....... 21.8 22.6 24.0 26.0 








Source: Department of Commerce, National Income, 1951. 








53 CHANGES IN THE FUNCTIONAL DISTRIBUTION OF INCOME 








OF INCO: 


TABLE B (Continued) 
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1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 
4,821 6,213 9,155 9,208 10,408 10,515 8,274 13,427 17,825 19,868 16,506 21,683 
1,390 1,224 1,106 1,158 949 845 744 470 605 570 660 548 
6,211 7,437 10,261 10,366 11,357 11,360 9,018 13,897 18,430 20,4388 17,166 22,231 
29,033 32,076 40,964 52,036 63,197 66,183 63,109 68,884 81,220 89,993 87,392 96,547 
6,211 7,437 10,261 10,366 11,357 11,360 9,018 13,897 18,430 20,438 17,166 22,231 
35,244 39,513 51,225 62,402 74,554 77,543 72,127 82,781 99,650 110,431 104,558 118,778 
82.4 81.2 80.0 83.4 84.8 85.4 87.5 83.2 81.5 81.5 83.6 81.3 
17.6 18.8 20.0 16.6 15.2 14.6 12.5 16.8 18.5 18.5 16.4 18.7 
6,211 7,437 10,261 10,366 11,357 11,360 9,018 13,897 18,430 20,438 17,166. 22,231 
1,462 2,878 7,846 11,665 14,406 13,525 11,215 9,583 11,940 13,028 10,989 18,593 
7,673 10,315 18,107 22,031 25,763 24,885 20,233 23,480 30,370 33,466 28,155 40,824 
29,033 32,076 40,964 52,036 63,197 66,183 63,109 68,884 81,220 89,993 87,392 96,547 
36,706 42,391 59,071 74,067 88,960 91,068 83,342 92,364 111,590 123,459 115,547 137,371 
79.1 75.7 69.3 70.3 71.0 72.7 75.7 74.6 72.8 72.9 75.6 70.2 
20.9 24.3 30.7 29.7 29.0 27.3 24.3 25.4 27.2 27.1 24.4 29.8 
6,211 7,437 10,261 10,366 11,357 11,360 9,018 13,897 18,430 20,438 17,166 22,231 
3,442 3,508 3,908 4,472 5,074 5,834 5,925 4,257 5,286 6,348 7,186 7,896 
9,653 10,945 14,169 14,838 16,431 17,194 14,943 18,154 23,716 26,786 24,352 30,127 
29,033 32,076 40,964 52,036 63,197 66,183 63,109 68,884 81,220 89,993 87,392 96,547 
38,686 43,021 55,133 66,874 79,628 83,377 78,052 87,038 104,936 116,779 111,744 126,674 
75.0 74.5 74.3 77.8 79.4 79.4 80.9 79.1 77.4 77.1 78.2 76.2 
25.0 25.5 25.7 22.2 20.6 20.6 19.1 20.9 22.6 22.9 21.8 23.8 
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TABLE C 
LABOR AND PROPERTY SHARES: THE CLAIMS CONCEPT OF INCOME 


1929-1950 
(money figures in millions of dollars) 





















1929 1930 1931 1932 1933 1934 1935 1936 1%; 





Property Sector: 
re 21,755 14,381 9,359 5,655 7,836 9,052 10,227 12,530 13.4 
2. Net Cap. Gains. Corps.......... 816" -—290 —1,404 -—1,563 —1,423 — 55 231 439 
3. Net Cap. Gains, Indiv.......... 2,893 -—1,360 -—2,718 —2,682 —1,403 — 630 —89 589 =| 
4. Total Property (Concept I)..... 25,464 12,731 5,237 1,410 5,010 8,367 10,369 13,558 13,4 
Labor Sector: 
5. Total Labor Income............ 64,571 56,723 47,073 35,452 35,062 40,724 47,015 52,737 59% 
6. Less: Contrib. for Soc. Ins... ... 371 377 383 391 388 427 462 767s 1,@ 
Pc hacdiaanalnaammai ae sacar 64,200 56,346 46,690 35,061 34,674 40,297 46,553 51,970 58,08 
8. Transfer Payments............. 1,499 1,544 2,673 2,152 2,113 2,193 2,389 3,520 
9. Total Labor (Concept I)........ 65,699 57,890 49,363 37,213 36,787 42,490 48,942 55,490 60,4) 
10. Total Property (Line 4)........ 25,464 12,731 5,237 1,410 5,010 8,367 10,369 13,558 13,41 
11. Total Income (Concept I)....... 91,163 70,621 54,600 38,623 41,797 50,857 59,311 69,048 73,84 
Ratios: Concept I 
12. Total Labor (Line 9)........... 72.1 82.0 90.4 96.3 88.0 83.5 82.5 80.4 8 
13. Total Property (Line 10)....... 27.9 18.0 9.6 3.7 12.0 16.5 17.5 19.6 18! 
| 
14. Total Property (Line 4)........ 25,464 12,731 5,237 1,410 5,010 8,367 10,369 13,558 13,4 
15. Add: Cap. Cons. Allowances.... 8,816 8,747 8,312 7,663 7,245 7,218 7,369 7,684 | 7,2 
16. Total Property (Concept III).... 34,280 21,478 13,549 9,073 12,255 15,585 17,738 21,242 
17. Total Labor (Line 9)........... 65,699 57,890 49,363 37,213 36,787 42,490 48,942 55,490 
18. Total Income (Concept III)..... 99,979 79,368 62,912 46,286 49,042 58,075 66,680 76,732 
Ratios: Concept III 
19. Total Labor (Line 17).......... 65.7 72.9 78.5 80.4 75.0 73.2 73.4 72.3 
20. Total Property (Line 18)....... 34.3 37.1 21.5 19.6 25.0 26.8 26.6 27.7 





Source: Department of Commerce, National Income, 1981: capital gains from Lawrence H. Seltzer, The Nature and Tax Treat 
of Capital Gains and Losses (National Bureau of Economic Research, 1951), pp. 367, 531. 

® Estimated from Department of Commerce, National Income, 1951, Table 38, p. 202. 

> Includes employer contributions for social insurance and for private pension funds and employee contributions for soci ® 
surance. 
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TABLE © (Continued) 












1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 








13,887 15,462 19,110 20,239 22,250 23,243 21,430 27,855 33,530 37,026 34,410 40,909 
76 —672 -956 -—176 -160 63 641 = 1,269 924 975" 
—269 —437 -—896  —380 1,057 1,602 4,267 6,644  4,450° 4,325° 
13,694 14,353 17,258 19,683 23,147 24,908 26,338 35,768 38,904 42,326 


59,268 64,498 81,428 108,308 136,097 150,230 154,386 154,292 164,900 180,312 173,158 190,849 
2,292 2,452 2,967 3,715 4,908 5,897 7,019 7,222 7,268 7,084 7,698 9,379 
56,976 62,046 78,461 104,593 131,189 144,333 147,367 147,070 157,632 173,228 165,460 181,470 
2,963 3,119 3,119 3,150 2,971 3,597 6,179 11,420 11,803 11,285 12,352 15,082 
59,939 65,165 81,580 108,043 134,160 147,920 153,546 158,490 169,435 184,513 177,812 196,553 
13,694 14,353 17,258 19,683 23,147 24,908 26,338 35,768 38,904 42,326 

73,633 79,518 98,838 127,726 157,307 172,828 179,884 194,258 208,339 226,839 


81.4 81.9 82.5 84.6 85.3 86.1 85.4 81.6 81.3 81.3 
18.6 18.1 17.5 15.4 14.7 13.9 14.6 18.4 18.7 18,7 


13,694 14,353 17,258 19,683 23,147 24,908 26,338 35,768 38,904 
8,101 8,440 9,294 9,981 10,680 11,887 12,410 12,163 14,845 
21,795 22,793 26,552 29,664 33,827 36,795 38,748 47,931 53,749 
59,939 65,165 81,580 108,043 134,160 147,920 153,546 158,490 169,435 
81,734 87,958 108,132 137,707 167,987 184,715 192,294 206,421 223,184 


73.3 74.1 75.4 78.5 79.9 80.1 79.8 76.8 75.9 
26.7 25.9 24.6 21.5 20.1 19.9 20.2 23.2 24.1 





* Estimated from Treasury Department, Statistics of Income. It has been assumed, for 1947 and 1948, that short term gains are 
igible. Therefore, the net gains as reported for tax purposes have been doubled to produce the estimates here. 
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TABLE D 


LABOR AND PROPERTY SHARES, 1919-1929 
(money figures in millions of dollars) 








1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 





Property Sector: 


ee eee ee 4.0 43 45 4.9 5.2 656 5.5 65.1 5.1 4.9 4.9 

DS. TEED oo cccsve 2.9 3.2 3.0 3.0 3.8 3.8 4.4 4.7 5.1 5.5 6.3 

De BR ccc ciccscce 3.2 3.7 3.9 40 4.2 4.4 46 4.7 4.9 5.3 5.6 

4. Corp. Net Savings. 1.0 2.2 ooh -23 -97 -42 83 2.3 .56 -92 1.5 

5. Total Property.... 11.1 13.4 12.11 12.13 14.17 14.22 15.33 16.8 15.66 16.62 18.3 
Labor Sector: 


6. Wages & Salaries... 36.7 43.3 34.9 36.4 42.7 42.7 44.4 47.4 47.8 48.7 51.5 


7. Other Pay. to Basi. .43 57 .60 -60 -62 -62 -61 .62 -65 -66 .69 
8. Entre. Withdr..... 11.8 13.5 10.3 10.8 11.3 11.9 12.5 12.5 12.6 12.9 13.4 
9. Entre. Savings..... 6.5 1.6 .63 —.09 1.2 .87 1.6 2.1 i wa 2.1 
10. Total Labor....... 54.43 58.97 46.43 47.71 55.82 56.09 59.11 62.62 62.15 63.17 66.69 
11. Total Property 

EI cts tedect: dhe 11.1 13.4 12.11 12.13 14.17 14.22 15.33 16.8 15.66 16.62 18.3 
12. Total Income...... 65.53 72.37 58.54 59.84 69.99 70.31 74.44 79.42 77.81 79.79 84.99 

Ratios: 

13. Total Labor (Line 

EPP CE rT 83.1 81.5 79.3 79.7 79.8 79.8 79.4 78.8 79.9 79.2 78.5 
14. Total Property 

ee 16.9 18.5 20.7 20.3 20.2 20.2 20.6 21.2 20.1 20.8 21.5 





Source: Simon Kuznets, National Income And Its Composition, 1919-38, Vol. I (National Bureau 
of Economic Research, 1941), pp. 216-17. 


SUMMARY 


The major conclusions which may be drawn from the foregoing in- 
vestigation of labor and property shares in total income are the follow- 
ing: 

1. Since the Twenties labor shares in total income have tended gen- 
erally to increase. The recovery period of the Thirties shows labor 
shares higher than in the Twenties, and higher in World War II than 
in the Thirties. However, the postwar period has tended to reduce labor 
shares very nearly to their position in the late Thirties. 

2. The available data on the distribution of income by size class, 
when compared with changes in labor and property income shares, 
would seem to indicate that increased labor shares are generally associ- 
ated with reduced concentration in the distribution of income. 

3. As economic activity declines, labor shares tend to increase. As 
economic activity increases, labor shares tend to decline. A decline in 
labor shares (except for World War II) seems to be accentuated by 
inflationary conditions. 

4, The measurement of income on the basis of claims, rather than on 
the basis of primary distribution, alters somewhat the relative change 
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in labor and property income since the Twenties, but does not alter the 
relationship between changes in economic activity and changes in 
labor’s relative share. 

5. Where the measurement of income includes capital consumption 
allowances, relative shares appear to be considerably more stable be- 
tween prosperity and depression. 

The conclusion which is of most general theoretical interest is the 
third—that increases in economic activity are associated with a reduc- 
tion in labor shares. This point is particularly significant if a reduction 
in labor shares is linked to increased conceniration in the distribution 
of income. For this will mean that, if the marginal propensity to con- 
sume of property income recipients is lower than the marginal propen- 
sity to consume of labor income recipients, increases in economic ac- 
tivity will tend to increase savings ratios and, therefore, increase the 
volume of investment necessary to maintain the higher level of income. 
Since inflation appears to accentuate the shift to property income, it 
may be said that the inflationary process itself makes it more difficult 
to maintain prevailing levels of national income. Unfortunately, our 
present knowledge of consumption ratios by income class does not per- 
mit a final assessment of the significance of this point. We do not now 
know the range of variation in the marginal propensity to consume by 
income class or by type of income. | 

It is also of considerable interest that the “normal” pattern of rela- 
tionship between labor and property shares and changes in economic 
activity was substantially altered during World War II. The increase in 
economic activity during this period was associated with an increase, 
not a decrease, in labor’s share of total income. A comparison of this 
experience with the experience of the years 1946-48 and 1950, when 
labor shares were declining, suggests that the effective stabilization 
program of the war years may be responsible for the large increases in 
labor shares of total income in these years. 











TH™ POST-ENUMERATION SURVEY OF THE 1950 CENSUS: 
A CASE HISTORY IN SURVEY DESIGN 


Eu1 8. Marks, W. PARKER MAULDIN AND HAROLD NISSELSON 
Bureau of the Census 


HE primary purpose of the present paper is to trace the develop- 

ment of the Post-Enumeration Survey, which was designed to 
measure error in the 1950 Censuses, and to outline the reasons for the 
decisions made and the alternatives considered in designing this survey. 
This paper points up the extent to which decisions on survey design 
have to be reached on the basis of intuition and opinion, due to the 
absence of any satisfactory objective data. Such analysis directs atten- 
tion to the major gaps in the knowledge of survey technique—the 
spots where there is need to supersede art with science—and may raise 
as subjects for investigation, points which have been implicitly ac- 
cepted as axiomatic. For example, discussions of field methods usually 
devote considerable space to improving the selection and training of 
interviewers but practically no attention is given to the problem of 
whether the gains from such improvements are commensurate with the 
effort and expenditure involved. 

The existence of errors in a Census is readily apparent in view of the 
vastness of the undertaking, with its necessarily large and (in large 
part) inexperienced enumerating staffs who are called on to obtain facts 
on a wide variety of subjects from a group of respondents who may be 
given only a partial understanding of what is wanted and who may 
have a very incomplete knowledge of the information requested. In 
addition to these and other problems in the collection of the data, there 
are also human and mechanical failures in the processing of the data 
that are not always detected. 

The recognition of the existence of such errors is not new. For many 
years, the Census volumes have contained introductory statements 
pointing out inconsistencies and possible sources of error in the statis- 
tics. Also, analyses of error in, and questions regarding the accuracy of, 
Census data can be found in numerous articles, a few of which are 
listed at the end of this paper. 

‘ What is relatively new is the attempt to provide a measure of the 
errors on the basis of an independent enumerative check. Checks of 
this sort were conducted in connection with the 1945 Census of Agricul- 
ture, the 1947 Census of Manufactures, and the 1948 Census of Busi- 
ness. These experiences formed the basis for planning the 1950 Census 
Post-Enumeration Survey. 
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The design of the Post-Enumeration Survey was premised on a need 
for a high level of accuracy—a level substantially higher than required 
or attained in the Census. Since the Census was taken with considerable 
attention to securing accurate results, such a requirement was quite 
rigorous. On the other hand, it was agreed that perfection was neither 
attainable, nor necessary, in order for the results to be useful. 

Not only was there the desire to measure with precision the 
amount of error, but also, to the extent possible, to determine the 
reasons for error. This second feature is particularly significant since 
exploration of the sources of error should provide our most valuable 
guide to improving future surveys. It can be seen that a study of “re- 
sponse variation” would not answer these purposes. Measures of re- 
sponse variation—i.e., the differences which occur when identical ques- 
tions are asked under (presumably) identical conditions—help to point 
out weaknesses in our data. They do not, however, throw much light 
on how to correct these weaknesses. The fact that two responses differ 
indicates that one (or both) of the responses is in error but does not 
indicate which one (if either) is correct. 

A brief statement indicating the organization of the Bureau may 
help in understanding how the Post-Enumeration Survey was planned 
and conducted. Under the Director and Deputy and Assistant Direc- 
tors, there are a number of Divisions. Those concerned in the Decennial 
Census, and hence in the Post-Enumeration Survey, were the Agricul- 
ture Division, the Field Division, the Geography Division, and the 
Population and Housing Division. The Office of the Assistant Director 
for Statistical Standards was given the responsibility for coordinating 
these interests. All major issues were reviewed and decisions formulated 
or revised in a committee consisting of representatives of all these or- 
ganizational units. In addition, valuable advice and assistance was re- 
ceived from others outside the Bureau. This paper is, necessarily, a 
report not of work done by a single individual, but of decisions and 
plans formulated by a group. 

In preparing the paper, minutes of meetings and other working rec- 
ords have been consulted but many of the gaps in these records (par- 
ticularly in the area of alternatives considered) had to be filled in from 
memory. Of necessity, the description of the survey is telescoped and 
many important details have been omitted. A complete description of 
the checking methods used in the Census, or even of the Post-Enumera- 
tion Survey itself has not been attempted. Materials of this type will 
be incorporated in the publications giving the results of the Census and 
Post-Enumeration Survey. 
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SOME MAJOR FACTORS IN THE SURVEY DESIGN 


The Post-Enumeration Survey was primarily concerned with: 
A. Coverage errors, consisting of: 


(1) Omissions of persons, households, dwelling units, and farms. 
(2) Duplications of persons, households, dwelling units, and 
farms. 
(3) Inclusion of persons, households, dwelling units, or farms, 
but in the returns for the wrong area. 
B. Content errors, consisting of omissions or incorrect entries as re- 
sponses to the specific questions on the schedules. 


One of the first problems that had to be met was that of sample de- 
sign. In designing the sample several factors had to be considered: 

(1) Measurement of omissions. For most purposes, the Census pro- 
vides a satisfactory listing of the population. It cannot, of course, pro- 
vide a satisfactory listing for the purpose of measuring its own com- 
pleteness and, in general, any other source would be even less satis- 
factory. Thus, it was necessary to find some technique for drawing a 
sample of those units (persons, households, families, farms) which were 
not listed in the Census. The only technique available appeared to be 
to draw a sample of small areas (segments) and make special efforts in 
the Post-Enumeration Survey to list all units which were in the seg- 
ment at the time of the Census. It could then be determined which 
units were enumerated by comparing the Post-Enumeration Survey 
segment listings with the Census listings. 

(2) Measurement of duplications and inclusions in the wrong area. 
Checking “coverage” of the Census involves both the completeness of 
the enumeration and its accuracy. The segment sample would provide 
a measure of completeness. For checking accuracy of coverage, how- 
ever, it was also necessary to determine how many units had been 
erroneously included in the Census listing. To identify erroneously-in- 
cluded units by comparison of the Post-Enumeration Survey segment 
listings with the Census listings would have required constructing a 
Census listing for the segment. While all units in the Census can be 
identified as being located in a particular Enumeration District (the 
area assigned to a particular enumerator), the Post-Enumeration Sur- 
vey segments were considerably smaller than an Enumeration District. 
In many cases (approximately 20 per cent of the total), addresses 
shown within Enumeration Districts are not specific enough to identify 
the location by segment. Thus, any segment listing constructed from 
the Census would inevitably contain units which had been enumerated 





1953 


ns. 


nd 


Me Oe 





THE POST-ENUMERATION SURVEY OF THE 1950 CENSUS 223 


in the correct Enumeration District but which were not actually lo- 
cated in the segment to which they had been assigned. This would re- 
sult in an overestimate of the number of erroneously included units and 
would also increase the sampling variance of estimates of net coverage 
error (i.e., the difference between erroneously omitted and erroneously 
included units). 

An alternative would have been to estimate the number of errone- 
ously included units by subtracting an estimate of the number of units 
in the Census which were enumerated in the correct place (as deter- 
mined from the Post-Enumeration Survey interviewers’ segment list- 
ings) from the total number of units enumerated in the Census. Assum- 
ing a fixed total expenditure, however, an estimate obtained in this 
manner would be subject to a considerably larger sampling error than 
one obtained by sampling the Census listings and directly identifying 
the units erroneously included. 

In addition, the indirect technique would not give any clues regard- 
ing the reasons for over-enumeration, i.e., how many over-enumerations 
were a result of carelessness, how many were a result of complexities or 
ambiguities in the rules determining who should be enumerated, etc. 
The segment technique would require a third visit to determine the 
reasons for over-enumerations. 

Thus, it was decided to use two samples, a segment sample for units 
which were missed and a sample of Census listings to measure erroneous 
enumerations. Since an interview to check on over- or under-enumera- 
tion was necessary for every unit in both samples, it would have been 
desirable to overlap the samples to the maximum extent possible. Com- 
plete overlap would have required that one be able to identify in ad- 
vance the segment in which each Census listing was located. Although 
this was impossible for the reasons discussed above, an effort was made 
to obtain a high degree of overlap. 

(3) Measurement of content error. With this design, the units which 
were defined correctly (e.g., a farm which existed, which should have 
been enumerated and which was enumerated in the correct area, but 
not necessarily enumerated with the proper characteristics) might have 
been obtained from either or both samples. Information from such units 
was desired to determine whether their characteristics had been cor- 
rectly reported. 

Where overlap of the two samples was not achieved, it was necessary 
either to take complete interviews on all units in both samples and 
adopt a complex weighting system or to restrict the check on accuracy 
of characteristics for correctly included units either to those in the “seg- 
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ment sample” or those in the “list sample.” Complete interviews for all 
units would, of course, have been more expensive than restricting com- 
plete interviews to only one of the samples (particularly when the cost 
of weighting is included in the expenditures). The additional expendi- 
ture would not have been justified by the increase in information ob- 
tained. Because it was considered desirable to furnish the check inter- 
viewer with the information obtained in the Census for a given unit 
(see the discussion below of “Transcription”), it was decided to restrict 
complete interviews to units in the sample of Census listings since they 
could be identified in advance. It was also necessary, of course, to ob- 
tain complete information from the “missed” units in the segment 
sample. 

(4) Length of interview. In designing the Post-Enumeration Sur- 
vey interview, emphasis was placed on investigating each variable 
thoroughly even if this meant restricting the number of variables to be 
checked. After the elimination of a great many Census variables from 
the Post-Enumeration Survey, it was found that the interview would 
still be quite extensive if the remaining variables were to be checked 
thoroughly. This did not present any serious problem except in farm 
households where it might be necessary to check the definition of the 
dwelling unit and of the farm, the completeness and correctness of the 
listing of persons, the housing characteristics of the dwelling unit, the 
farm characteristics, and the characteristics of each member of the 
household. Experience in a pretest indicated that a full interview for 
farm households would average 1? hours and that the upper quartile 
would exceed two hours. It was felt that interviews of this length would 
be undesirable. 

The alternatives were (a) to drop additional items from the check, 
(b) to check some (or all) of the items less thoroughly, or (c) to check 
some of the items in one sample and others in another—in particular 
to separate the farm check from the check on population and housing 
characteristics. Dropping items from those originally selected was not 
an acceptable solution since appreciable savings in interview time could 
be made only by eliminating items which were of primary importance. 
A less thorough check was also undesirable—more, rather than less, 
investigation might be necessary to insure results of the required ac- 
curacy. Checking different items in different samples was feasible but 
might mean more complex instructions to the field staff, complications 
in processing the data, increased cost or increased sampling error and, 
possibly, increase in the error from sources other than sampling varia- 
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tion. Despite its disadvantages, this alternative did permit cutting the 
length of interviews in farm households without affecting the interviews 
for nonfarm households. 

An additional factor in the decision was the complexity of the job 
facing the interviewer—he would have to master the definition of a 
dwelling unit and of a place requiring a farm check; the definitions of 
more than 100 different Census variables; the problems of defining 
“usual residence” and “farm headquarters”; the special procedure for 
occupied living quarters which did not meet the dwelling unit definition 
and for other places which were by definition not dwelling units; the 
use of maps and aerial photographs; the preparation of rough detail 
maps; the methods of recording on seven different forms; how to inter- 
view and how to probe for causes of discrepancies between his own re- 
sults and those obtained by the Census enumerator; and the various 
special modifications of the procedures required by subsidiary studies 
which were linked to the Post-Enumeration Survey. 

In the light of these considerations, it was decided to draw three 
samples in rural areas: the segment sample to be checked to locate 
missed households and farms; a sample of Agriculture Census listings 
to be checked for over-enumeration and for farm characteristics (the 
farm sample was overlapped with the segment sample as much as possi- 
ble); and an independent sample of listings from the Population and 
Housing Census to be checked for over-enumeration on population and 
housing characteristics. The segment and farm samples were assigned 
to rural interviewers and the household sample to urban interviewers. 
The split was not made in urban areas where farm checks would be 
relatively infrequent. 

Although the decision was predicated in part on the assumption that 
length of time required for a complete interview in farm households 
would be excessive, this is by no means an established fact. The length 
might have been feasible and, even if it were not, there were other 
means of reducing interview length—actually it was later decided (see 
below) to subsample individuals within households which cut the length 
of interview per household, but in view of the other considerations and 
the pressure of circumstances, the decision to check agriculture items 
and population and housing items in separate samples was not re- 
examined. 

(5) Selection, training and supervision of interviewers. The objectives 
of the Post-Enumeration Survey required that interviewers be care- 
fully selected, well-trained and competently supervised. With a fixed 
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budget, increasing the number of interviewers meant either reducing 
the training and supervision expenditures per interviewer or cutting at 
some other point. 


INITIAL DECISIONS ON SURVEY DESIGN 


As a first step in design of the sample, comparisons were made of 
estimated costs of different designs. Preliminary computations pointed 
to use of a design involving 300 primary sampling units. Drawing of a 
sample on this basis was, therefore, initiated. A two-stage sample was 
drawn, drawing primary sampling units consisting of a county or a 
group of contiguous counties and within the primary sampling unit 
selecting segments containing 6 households in urban areas or 10 house- 
holds in rural areas. Later recalculation as shown in Table I indicates 
minimum cost for 200 primary sampling units. With the rough assump- 
tions made about costs and intraclass correlations, the differences be- 
tween 200 and 300 primary sampling units (with the same staffing pat- 
tern) are not of magnitude sufficient to form a basis for choice and the 
design with 300 primary sampling units actually used is presumed to be 
near the minimum cost design. 

Final computations made to compare different survey designs are 
summarized in Table I. Designs 1 to 7 assume that all interviewers 
would conduct population, housing and agriculture interviews. Designs 
8 to 13 assume that one set of interviewers (urban) would conduct 
population and housing interviews and another set (rural) would con- 
duct agriculture interviews, within the same sample of primary sam- 
pling units but in different segments. Designs 2, 3, 6, and 7 show differ- 
ences in cost associated with different numbers of primary sampling 
units in the situation where each interviewer would be assigned one 
primary sampling unit. Designs 1 and 4 indicate the case where each 
primary sampling unit is split between interviewers. In this case, there 
is a substantial increase in training and supervision costs with no com- 
pensating economies (under the assumptions made). In Design 5 there 
is only 1 interviewer for each 2 primary sampling units, which involves, 
therefore, an increase in travel and enumeration costs (mainly due to 
the fact that the interviewer would be paid subsistence allowance while 
working away from his home primary sampling unit) but this increase 
is compensated by a decrease in the training and supervision costs. 
Designs 8 through 13 show similar variations in the more complex situa- 
tion where two types of interviewers are used and the ratios of rural 
interviewers per primary sampling unit and urban interviewers per 
primary sampling unit are varied. 
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Within the selected primary sampling units, it was decided to use 
segments containing about 6 households in urban areas and about 10 
households in rural areas. From a standpoint of sampling variance, this 
size of segment may be too large for maximum efficiency. On the other 
hand, smaller segments would have meant problems in obtaining clear 
definition of boundaries, thus introducing “coverage” errors into the 
Post-Enumeration Survey. 


SPECIALIZATION OF INTERVIEWERS 


The decision to separate the agriculture check from the check of 
population and housing and to specialize the interviewers has been 
noted above. While estimates of the cost of such specialization are avail- 
able (see Table I), no estimates of the gains have been made—Table I 
presents cost for a fixed sampling error but does not take into account 
other types of error. It was anticipated that the complexity of the inter- 
viewer’s job might result in some confusion on his part so that his per- 
formance would be seriously impaired. It was also felt that, within the 
available training budget, intensive instruction on all phases was im- 
possible unless the scope of the job was reduced. Thus, the anticipated 
gain from specializing interviewers was in accuracy and in reduction of 
the cost of correcting inadequate work. The magnitude of these gains 
could not be estimated even roughly in advance, and the survey itself 
yields only impressions on this point. While the decision made assumes 
implicitly that the gain was worth an estimated 20 to 30 thousand 
dollars additional cost (the difference in cost as shown in Table I is 
approximately $35,000 to $45,000, but part of this is chargeable to 
specialization and part to the additional sample in rural areas), this is a 
purely intuitive assumption. Subjective impressions indicate that in- 
terviewer training and initial performance were better than might have 
been obtained without specialization. 


TRANSCRIPTION 


Obviously the sample design affected (and was affected by) almost 
all phases of the Post-Enumeration Survey. Another problem which 
had considerable ramifications was the question of whether or not the 
re-interviewer should or should not be provided with the results of the 
original enumeration. 

There are definite advantages (at least in theory) to giving the check- 
ers the original data: (1) it permits the re-interviewer to check his own 
answers against the original entry and to determine more positively 
what the correct answer is; and (2) important leads can be obtained for 
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the improvement of future surveys by having the re-interviewer set 
down as much information as possible about the reasons for any errors 
found. On the other hand, there may be danger of minimizing the ex- 
tent of error due to the interviewer ‘‘confirming”’ the original entry even 
where it is wrong. In an early Census pretest there was evidence that 
this occurred. However, the post-enumeration survey interviewers in 


TABLE II 


PER CENT OF POPULATION AND HOUSING SCHEDULES SHOWING 
A DIFFERENCE BETWEEN ORIGINAL AND RE-INTERVIEW 
ENTRIES, POST-ENUMERATION SURVEY OF MAY 1949 
CENSUS PRETEST 











Group A* Group B* 
(Census Entries Available | (Census Entries Not Avail- 
Item to Post-Enumeration able to Post-Enumeration 
Survey Interviewers) Survey Interviewers) 

Age 29 25 
Place of Birth Tt Tt 
Work Status 11 8 
Occupation 29 31 
Industry 14 19 
Class of Worker 25 23 
Highest School Grade 41 38 
Condition of Unit 19 18 
Water Supply 6 15 
Bathtub or Shower 2 8 
Tenure 11 18 











* Percentages for housing items based on 54 Group A and 60 Group B cases, for highest school 
grade on 37 Group A and 34 Group B cases, and for other items on 100 to 200 cases. Figures for Group 
A represent the differences remaining after comparison and reconciliation between the re-interviewer’s 
initial entry and the figure transcribed from the original enumeration. Blank or omitted entries were not 
counted as differences. 

t Less than one per cent. 


this pretest were not specially selected or trained, and little restriction 
was placed on their use of the original data. 

The technique finally evolved for use of the original data in the Post- 
Enumeration Survey gave the interviewer the information obtained in 
the Census for the “characteristics” of persons, households, and farms. 
He was not to look at this information until he completed the inde- 
pendent interview and, to reinforce this instruction, the original Census 
entries were transcribed to a page which folded in so that inadvertent 
examination of the entries was practically impossible. After the inter- 
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viewer completed his independent questioning, he compared his entries 
with the Census data and attempted to reconcile the differences by 
further questioning of the respondent. 

In a test of the effectiveness of this procedure, conducted following a 


TABLE III 


NUMBER OF AGRICULTURE QUESTIONNAIRES SHOWING A 
DIFFERENCE BETWEEN ORIGINAL AND RE-INTERVIEW 
ENTRIES, POST-ENUMERATION SURVEY OF MAY 
1949 CENSUS PRETEST 

















Group B 
Group A s (Census entries not 
(Census entries available to email te, Dust, 
Post-Enumeration Survey Enumeration Survey 
interviewers) interviewers) 
Item 
No. of differences 
No. of Bafere After No. of No. of 
reports ssnendiite..| sesenaiite, reports | differences 
tion* tion* 
Corn 
acres 24 14 12 22 15 
bushels 23 19 18 22 17 
Cotton 
acres 23 12 10 15 9 
bales 23 10 9 15 8 
Fruit and nut trees 31 | 7 24 5 
Number of caitle 27 9 5 22 6 
Chicken eggs, sold 
dozens 18 16 11 16 14 
value 18 16 11 16 15 
Work off the farm 33 14 14 26 11 




















* Figures in the column “Before reconciliation” represent differences before the re-interviewer had 
examined the data from the original enumeration. Figures in the column “After reconciliation” indicate 
number of differences remaining after comparison and reconciliation between the re-interviewer’s initial 
entry and the figure transcribed from the original enumeration. 


Census pretest in May 1949, the interviewer was not given the original 
Census information for one-half of the sample cases. The results (see 
Tables II and III) indicated that the interviewer’s initial entries showed 
about the same proportion of differences from the Census entries re- 
gardiess of whether or not he was given the original Census information. 
In some cases, however, the reconciliation disclosed errors in the inter- 
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viewer’s initial entry which he could then correct. Because of the small 
size of the sample and difficulties in defining the population which it 
represents, dependable generalizations cannot be made from these data. 
However, the data would indicate that it was possible to secure the 
advantages cited above of supplying re-interviewers with the original 
Census data without serious effects in the direction of “confirming” er- 
roneous entries. As further control for a small proportion of units (5 per 
cent of the households and 10 per cent of the farms) in the Post-Enu- 
meration Survey sample, no transcription data were supplied the re- 
interviewer. 


CHECKING ACCURACY OF INTERVIEWING 


The subsample in which the interviewer was not given the original 
Census data supplies one check on the accuracy of his work. The rela- 
tive number and magnitude of the differences when the interviewer is 
given the original Census data should be about the same as when he 
does not have the data, unless he has looked at the information instead 
of obtaining an independent answer. 

Another vital problem in the interviewing was making every reason- 
able effort to insure completeness of the Post-Enumeration Survey 
canvass. The basic procedure involved giving the interviewer a list of 
persons, households, and farms which he was to complete. As a check, 
it was decided to delete the names of some persons actually enumerated 
in the Census and see whether or not the Post-Enumeration Survey 
interviewer “picked up” those persons. The re-interviewer’s picking-up 
of the omitted person indicated that he was correctly carrying out the 
procedure for the detection of missed persons. It is, of course, no guar- 
antee that the procedure was effective in detecting missed persons. 

A sample of about 170 Post-Enumeration Survey segments was re- 
checked by professional personnel of the Bureau to determine the ac- 
curacy of the identification of missed households. The results of this 
re-check indicated that the Post-Enumeration Survey may have under- 
estimated the (net) number of missed dwelling units by 30 per cent. 
This estimate is, however, subject to a sampling error of 15 per cent. 
The effect of this deficiency in the Post-Enumeration Survey on the 
estimates of missed persons appears to be small since the omitted 
dwelling units were, in large part, vacant units or units with only one 
occupant. 

DESIGN OF THE INTERVIEW 


In developing the questions used in the Post-Enumeration Survey, 
the possibility of simply repeating the original Census questions was 
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considered and, quite early, rejected. It was felt that this procedure 
would give a measure primarily of “variance” rather than of accuracy. 
Every effort was made to incorporate checks on those features which 
seemed to be a source of error in previous surveys. In many cases, 
several questions were asked in the Post-Enumeration Survey to elicit 
data obtained by a single question in the Census. The purpose of mul- 
tiple questions was to prevent error by calling the attention of respond- 
ents and interviewers to any inconsistencies or misunderstandings and 
to assist the respondent toward a more complete recall of the desired 
information. The development of the questions on income illustrates 
this procedure. 

In the Census three questions were asked about income. These ques- 
tions referred to (1) wage and salary income, (2) income from own 
business, (3) other kinds of income (e.g., pensions, rent, dividends from 
stocks, bonds, etc.). Considerable experimentation was carried on with 
question wordings pertaining to income in various Census surveys and 
also in the pretest of the Post-Enumeration Survey. Most of the varia- 
tions in the wordings of income questions attempted to check on the 
specific errors which were thought to be most frequent. 

In a special study on housing in Baltimore, a different approach, the 
“job history” approach, was used to obtain a check on the accuracy of 
wage and salary income data. The essential feature of this approach is 
accounting for all periods of employment or unemployment during the 
year. For each job held, a series of “check” questions on income is 
asked and for each period of unemployment, questions are asked about 
unemployment compensation. The job history approach makes for a 
considerably longer interview but the results of the Baltimore study 
indicated that the job history approach detects income which may be 
omitted when other techniques are used. Consequently, for the Post- 
Enumeration Survey, the job history approach was adopted. 

Another important consideration in design of the interview was the 
specification of the respondent. In the Census, the interviewer was per- 
mitted to obtain information from any one of several qualified respond- 
ents. It was suggested that the interviews in the Post-Enumeration 
Survey be obtained from the person himself (except for minor children, 
mentally incompetent individuals and similar cases). It was antici- 
pated that this would improve the accuracy of the responses (since the 
person himself is usually the best informed respondent for his own 
characteristics) but might be excessively costly and might also result 
in a higher non-interview rate. The costs and non-interview rates were, 
therefore, explored in the Post-Enumeration Survey pretests. These 
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pretests showed that the restrictions on the respondent resulted in no 
significant increase in non-interview rates and somewhere between 40 
and 50 per cent additional calls (and, presumably, about this increase in 
total travel and “contact” costs). It was decided that the cost increase 
would not be prohibitive and the Post-Enumeration Survey interview- 
ers were told to interview only the person himself (with the exceptions 
noted). No data are available for determining whether the increased 
costs were justified. It was proposed that a test of this point be in- 
corporated in the Post-Enumeration Survey but the proposal was re- 
jected as representing a substantial additional complication in both 
transcription and interviewing. 


TIMING OF THE POST-ENUMERATION SURVEY 


It was necessary to weigh a possible loss of accuracy if the Post- 
Enumeration Survey were started late against a loss if it were started 
early. If the interviewing were to be postponed, respondents would be 
questioned about an event which might be increasingly remote in time 
(and possibly in place). This is not serious for “static” items such as 
date of birth, but is very important for items whose recall tends to be 
obscured by subsequent events such as jobs held and income received 
during 1949. 

On the other hand, it would have been undesirable to start the Post- 
Enumeration Survey before the completion of the Census enumeration. 
Assurance that there would not be a bias in the Census enumeration of 
units in the Post-Enumeration Survey sample could be achieved only 
by making sure that these units were not distinguished in any way 
from other units during the Census. Theoretically, a given segment 
could have been started as soon as enumeration had been completed 
for the Enumeration District in which it was located. Actually, since 
Enumeration Districts, even with a restricted area such as a county, 
are completed at widely different times, there were two objections: 


1. Selection of interviewers from a given county would have been 
notification that this county was in the Post-Enumeration Survey 
before all work in the county had been completed. 

2. Before the Post-Enumeration Survey work in a segment could 
start it was necessary to transcribe information from the Census 
schedules. If this had been done in the local Census office, the 
local supervisor might have been motivated to have the Post- 
Enumeration Survey areas checked and the check might have re- 
sulted in alterations of the Census results. 
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Consideration of the dangers of bias led to a decision that transcription 
be done after all Census enumeration in a county had been completed 
and sent to the central Census processing sections in Washington and 
Philadelphia. 

In retrospect, the disadvantages of an early start seem as serious as 
they did at the time; but costs of delay, even beyond the delay antici- 
pated, seem to have been less serious, i.e., observation of the Post- 
Enumeration Survey interviewing suggests less recall loss and less diffi- 
culty in locating respondents than had been feared. 


SELECTION OF INTERVIEWERS 


It was generally agreed that the nature of the Post-Enumeration 
Survey required interviewers of top competence. What was meant by 
“top competence” and how to select such interviewers were matters on 
which it was almost impossible to secure agreement. Among the qualifi- 
cations one might list for a “good interviewer” are intelligence (or, more 
generally, “alertness”) and ability to get people to talk freely (secure 
“rapport”). For Census work, certain less obvious qualifications seem 
to be important. Coverage is essential and coverage requires meticu- 
lousness and persistence ; requires, in fact, an “over-conscientiousness” 
which may be in direct opposition to the characteristics ordinarily as- 
sociated with a “good interviewer.” Checking answers requires an in- 
terviewer who will critically evaluate each answer and will not accept it 
until he has probed the matter to the hilt. However, a critical attitude 
alone is insufficient—the respondent’s cooperation and interest is essen- 
tial and the interviewer must be able to secure these and still maintain 
a “cautious skepticism.” 

All of the above are qualifications hard to define and equally hard to 
recognize even if found. To set down criteria (with sufficient exactness 
to make them usable) was practically impossible. One suggested pro- 
cedure was direct recruitment by persons traveling out of the central 
office. Such persons might, it was felt, apply adequately criteria of the 
type cited without elaborate written specifications. Aside from the fact 
that such an advantage was only hypothetical and the fact that suffi- 
cient central personnel were not available and that there are counter- 
balancing advantages to field selection of the interviewers, considera- 
tions of morale in the field organization can be raised against such a 
procedure. It would seem desirable to let the immediate supervisor 
have final choice of the interviewers with the accompanying feeling of 
responsibility for their performance, i.e., not put the supervisor in the 
position of excusing poor performance as being the work of “a bunch of 
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deadheads foisted on him by the central office.” On the other hand, 
complete freedom of choice for the local supervisor might, in some cases, 
mean the placing of personal friendship or other considerations above 
basic qualifications for the job, particularly if interviewer selection were 
to be initiated before the supervisor had been given a complete picture 
of the scope of the job required. These considerations pointed to re- 
stricting the supervisor’s choice to those candidates meeting specified 
minimum requirements but, within this class, leaving the final choice 
in the supervisor’s hands. 

Another knotty problem was whether supervisors and interviewers 
should be selected only from persons with previous Census experience. 
With regard to the supervisors, there was unanimous agreement that 
previous Census experience was essential, since an inexperienced person 
would have his energies diverted from his major tasks by difficulties 
involved in mastering Census administrative procedures, which, how- 
ever, would be familiar to experienced Census supervisors. It was, 
therefore, decided to select Post-Enumeration Survey supervisors from 
experienced members of the Bureau’s permanent field staff. 

Familiarity with Census concepts and procedures was also desirable 
in the interviewers and selecting persons with Census experience would 
mean cost savings in interviewer training. It was also felt that among 
the 150,000 enumerators employed on the 17th Census, there must be 
many of outstanding competence and we might be more successful in 
locating these than in locating competent interviewers among candi- 
dates whom we had not had an opportunity to observe in an actual 
interviewing situation. 

As against the cost argument, there was the feeling that restriction 
of selection to Census enumerators ruled out classes of persons among 
whom there might be some highly desirable candidates (e.g., graduate 
students, teachers and professors, and experienced persons from other 
survey organizations available for the Post-Enumeration Survey dur- 
ing the summer but not for Census work in the spring). It was also felt 
that the problem of getting reliable reports on an individual’s per- 
formance as a Census enumerator was considerable since the enumera- 
tors worked for only a short time and there was little opportunity to 
review their work. 

Another objection to the use of interviewers who had worked on the 
Census was the possibility of bias if a person happened to be assigned 
to check his own work or work done under his supervision. The possible 
bias arising from an interviewer’s checking his own work is trivial, since 
the probability of cases from any Enumeration District being in the 
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sample was quite small (and most enumerators worked in only one or 
two Enumeration Districts). Bias due to the interviewer checking work 
which he supervised in the original enumeration might be more sub- 
stantial but could be (and was) avoided by not assigning any inter- 
viewer to any area for which he was responsible during the Census. 

Since there was no basis for assuming that one method of selection 
was better than another in terms of quality, it was finally decided on 
cost grounds to restrict the interviewer selection to persons with e- 
perience in the Census. This probably imposed only a relatively minor 
restriction (since only 260 interviewers were finally selected and the 
available pool was many thousand). On the other hand, it is by no 
means certain that another type of selection would have meant either 
substantially increased cost or lower quality. 

It was necessary, in large part, to abandon the attempt to spell out 
detailed selection criteria. As a first step, selection was restricted to 
persons who made a high score on the general selection test given to all 
candidates for enumerators in the regular Census. Additional criteria 
finally adopted were that (1) the interviewer score in the top two-thirds 
on a Post-Enumeration Survey selection test which measured ability 
to read and follow instructions and a schedule of the type actually used 
(but did not get at any of the personality characteristics cited above) ; 
(2) geographic distribution of interviewers be restricted so that about 
two-thirds of them would live in an area they were to enumerate; and 
(3) supervisors interview at least 4 candidates for every interviewer 
position. 


TRAINING OF INTERVIEWERS 


Considerable experience was available from the Census and other 
previous studies on techniques of interviewer training. Such techniques 
had apparently worked satisfactorily and it was decided (mostly as a 
matter of convenience) to use the same methods with emphasis on: 


1. The interviewer going through several actual interviews in the 
field under observation as part of his training— it was felt that 
this was important in making the “classroom” training more 
meaningful. 

2. Problem solution and the “role-playing” technique (here, the 
trainer acts as respondent and is interviewed by one of the 
trainees while all of the trainees record the answers). 


It had been decided to use as interviewers only persons with previous 
Census experience. However, consideration of the possibilities of inade- 
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quate original training and development of “bad habits” during their 
previous work, led to the decision that the Post-Enumeration Survey 
training repeat and (on basic points) amplify the Census training. The 
Post-Enumeration Survey interviewers were given considerable addi- 
tional material on maps and canvassing (which were felt to be impor- 
tant for good coverage) and on interviewing techniques, particularly 
the problems of verifying an answer and probing for reasons for dis- 
crepancies from the original Census response. Considerable emphasis 
was placed upon explaining to the interviewers the reasoning behind the 
phrasing of the questions and the design of the procedures. 

In training the supervisors they were first put through the training 
designed for the interviewers (combining, with appropriate modifica- 
tions, the two training courses planned for rural and urban interview- 
ers). Examination of this experience indicated that training of rural 
interviewers was likely to prove unsatisfactory unless the material was 
simplified or the length of the training extended. To avoid a longer 
training period, the procedures planned for the rural interviewing were 
reviewed. This called attention to the fact that in checking population 
and housing characteristics a rural interviewer was required to use the 
same questionnaires used by an urban interviewer. The rural inter- 
viewer filled out these questionnaires only for missed persons and 
missed dwelling units and, for this purpose, it was not necessary that 
the information be any more precise or detailed than that collected for 
enumerated units in the original Census. Actually, this rural procedure 
was a carry-over from the earlier planning before it was decided to 
split the interviewer’s job. Although time was short and the training 
and interviewers’ manuals were already printed, it was decided that the 
cost of revision would be less than the cost (and difficulties) of carrying 
out the original plans. A Census schedule which had been used for self- 
enumeration was, therefore, revised by adding questions regarding the 
usual residence at the time of the Census of presumptively missed per- 
sons and this schedule was adopted for use by the rural enumerators. 
Although the work of revising the schedule and manuals was consider- 
able (and the timing was hectic), the revision permitted simplification 
of the rural interviewer’s job and of his training. 


MODIFICATION OF THE SURVEY DESIGN 


Having made the above decisions, work was started on drawing the 
sample, preparing the necessary maps and control lists, designing and 
printing of questionnaires and drafting of instructions. However, the 
design adopted was to be viewed as only an initial working document. 
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While the survey could have been carried through as initially designed, 
facts brought out in the course of the work dictated important modifi- 
cations. It was, in fact, a conscious policy to maintain flexibility of 
plans up to the point where efficiency required that they be frozen. 

As indicated above, the sample was intended to yield segments of 
approximately 6 households per segment in urban areas and 10 house- 
holds per segment in rural areas. Figures for about the first 20 per cent 
of the segments drawn indicated an average size considerably larger 
than the expected 7.6 households per segment. The complete sample, 
before the subsegmenting described below, had an average size of about 
35 households per segment. About 10 per cent of the segments had over 
35 households with an average of about 300 households. This was due 
to the fact that, in many cases, although there was known to be a large 
number of households in an area, the available maps did not show the 
detail necessary for subdividing the area and it had to be selected as a 
whole (with a lower weight). In most surveys, this difficulty can be 
overcome by subsampling within the segment and interviewing the 
units in the subsample. For checking coverage, however, it was de- 
sirable to have every household in a segment interviewed. One solution 
would have been to have the Post-Enumeration Survey interviewer 
divide the larger segments at the time he visited them and cover only 
one randomly selected subsegment. The difficulties of getting the Post- 
Enumeration Survey interviewer to execute this task satisfactorily, 
along with his other duties, made this solution unacceptable. Another 
alternative was to do the subsegmenting as a separate operation with 
the Post-Enumeration Survey interviewers, but before their training 
for the actual interviewing. This was rejected because it meant recruit- 
ing the interviewers earlier than had been planned (which was unde- 
sirable both because it meant disclosing prematurely some of the areas 
included in the Post-Enumeration Survey and because it meant a very 
awkward timing). The solution finally adopted was to do the subseg- 
menting with personnel from the Bureau’s Geography Division, a solu- 
tion which avoided the problem of using the Post-Enumeration Survey 
interviewer and had the additional advantage of doing the job with 
exceptionally well-qualified persons. 

As the sample was drawn and detailed specifications of the procedure 
were developed, more precise estimates of total costs could be made. 
It became evident that total costs would exceed the budgeted amount 
unless the design was modified. Since the purpose of the Post-Enumera- 
tion Survey was to check the Census results, there was a strong commit- 
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ment to obtain the maximum accuracy on a given interview even if this 
meant a relatively high cost per case. This ruled out effecting economies 
at the risk of reducing quality (e.g., by permitting an interview with 
any responsible person instead of requiring an interview with the per- 
son himself). The alternative was to reduce the sample size. The most 
obvious way to cut costs was to eliminate some primary sampling units 
from the sample. The objection to this procedure was that it meant 
either completely re-drawing the sample (scrapping the work done to 
date) or ending up with strata of very unequal size which would have 
increased the sampling error disproportionately to the cost savings. 
However, oversampling of the primary sampling units in the West 
(introduced originally to permit separate estimates for this region for 
agricultural characteristics) could be eliminated without appreciably 
affecting the variance of national estimates. This decision was adopted 
and reduced the number of primary sampling units to 270. 

As originally designed the sample called for interviews with all per- 
sons in the sample households. While this procedure is relatively simple, 
it is not necessarily efficient. Furthermore, on some characteristics, 
(e.g., immigration status, education and income) the Census had ob- 
tained information only for every fifth person so that for these char- 
acteristics the Post-Enumeration Survey sample would have been 
one-fifth the size of the sample for other characteristics (e.g., age, 
birthplace, occupation). In view of the budget situation, it was felt that 
cutting the sample for those characteristics obtained of every person in 
the household would not be too serious. Therefore, those persons for 
whom the Census asked all questions and one-quarter of the remaining 
persons in the Post-Enumeration Survey sample households were re- 
tained in the sample. 

Another economy was effected by permitting Post-Enumeration Sur- 
vey interviewers to obtain information for missed units from any re- 
sponsible respondent. The aim of the restrictions on the respondent for 
enumerated units was to improve on the original Census data. With 


respect to the detailed information obtained for missed units, it was 


only necessary to attain the accuracy level required in the original 
Census. 


RECORD CHECKS 


The design of the Post-Enumeration Survey emphasized securing 
results of high accuracy. Nevertheless questions were raised about the 
validity of the Post-Enumeration Survey: How can one be sure this 
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technique would give a better answer? How can you check the errors in 
one survey by taking another? This line of questioning frequently 
ended with a suggestion that there were records (presumably authorita- 
tive) which would give a better check than a re-interview. 

However, almost all record-matching studies end with unmatched 
cases and cases where the match is questionable. The errors found for 
the matched cases do not provide an estimate for all cases unless one is 
willing to make arbitrary assumptions with regard to the errors among 
unmatched cases. Even where a match has been established, one cannot 
conclude that the records are necessarily accurate. Where a birth cer- 
tificate was filed soon after the birth, the date recorded on it is, in gen- 
eral, dependable. Where it was filed several years later, one might have 
considerably less confidence in the recorded birth date. The accuracy of 
record data varies from one source to another and also from item to 
item on the same record. A record in the Veteran’s Administration files 
may provide conclusive evidence that John Doe is a veteran but not 
that he was born on the date stated on that particular record. 

The above considerations sharply limit the utility of record checks. 
Where errors are large for the matched cases, however, a record check 
provides a valuable warning of unreliability of data and useful leads for 
further investigation of sources of error. 

It was decided to experiment with record checks for age, birthplace, 
citizenship status, highest grade of school completed, income, veteran 
status, and industry in which employed. In pretests of the Post-Enu- 
meration Survey, additional data were included which might be useful 
in checking these items, e.g., an inquiry regarding branch of service and 
serial number for use in checking veteran status. In completing the 
matches for the pretest data, our attention was called to additional 
items which would improve the efficiency and the accuracy of the 
match. These items were added to the Post-Enumeration Survey ques- 
tionnaire. After collection of the data, the check of highest school grade 
completed was dropped on the grounds that the effort involved would 
have been out of proportion to the expected success. 


IMPLICATIONS FOR FUTURE RESEARCH 


Although the Post-Enumeration Survey possessed certain distinctive 
features peculiar to its purposes and content, its history illustrates the 
complex decision process involved in survey planning. This decision 
process requires the balancing of cost and accuracy considerations both 
within a particular aspect of a survey and among different aspects. The 
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over-all cost of a survey and the accuracy of the estimates the survey 
provides are resultants of the costs and accuracies associated with a 
variety of processes—sample selection, interviewing, coding, tabulat- 
ing, etc. Since most surveys have a fixed over-all expenditure limit, 
increased expense to improve the accuracy of one phase will usually 
require curtailed expenditure and lowered accuracy in some other 
phase—for example, may require a decrease in sample size with conse- 
quent increase in variance. Between survey processes a balance is nec- 
essary; to achieve a reduction of 5 per cent in coding error at the ex- 
pense of decreased checking of the interviewing with an attendant in- 
crease of 10 per cent in interviewing error is hardly desirable, particu- 
larly if the interviewing error is already larger than the coding error. 
Within a given survey process—for example, interviewing—an in- 
creased expenditure for training must be balanced against an increase 
in expenditures for supervision, interviewer salaries, more detailed 
questions, etc., in terms of the relative effect of the various expenditures 
on the quality of the interview output. 

In the process of balancing errors and costs between different survey 
operations, it is important to distinguish between the “variable” errors 
which decrease with increasing size of sample and the biases—which 
are unaffected by the sample size. A survey design may be character- 
ized by its sampling variance and bias and, in addition, by the variance 
and bias arising from other sources, such as interviewer, respondent, and 
processing error. By “nonsampling variance” is meant the variability 
of errors around the average error which is the bias. Depending upon 
the relationship between variable errors and biases, increased expendi- 
ture per case may or may not lead to a decrease in the over-all error. For 
example: Suppose that Method A with an expenditure of $1.00 per case 
would, if applied to the whole population, give an error (bias) of .2 
year in an estimate of average schooling and if the estimate were 
based on a random sample of a single case, give an (expected) error of 
10 years. Assume, also, that Method B with an expenditure of $20 per 
case would give an error (bias) of .05 year if applied to the whole 
population and of 8 years if applied to a single (randomly selected) case. 
The difference between the two methods arises, let us say, from greater 
expenditures for training and checking on all survey processes in the 
case of Method B. If cost were not a factor, Method B would be prefer- 
able. In most cases, cost is a factor and operates differently depending 
upoa the total funds available; using a simple random sample, Method 
A would give a mean square error of .060 for a total survey budget of 
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$5,000 compared with a mean square error of .258 for Method B. With 
a total survey budget of $100,000, the relationship would be reversed, 
Method A showing .041 mean square error and Method B .015.! 

In planning a survey, it is not sufficient to know that one technique 
gives more accurate results for individual cases than does another tech- 
nique. The amount of the improvement and the relative costs of the 
two techniques must also be known. In addition, the relationship 
among the different phases of a survey may be extremely important. 
Since the error in a survey estimate is not a sum of the errors in the 
different survey operations but tends rather to be the square root of a 
sum of squares of errors, reduction of a given amount in a relatively 
large source of error has more impact on total error than an equal re- 
duction in a small source. For example, reducing the response bias in 
an estimate of average income from $250 to $200 will have more effect 
on total error than reducing the sampling error (standard deviation of 
the estimate) from $100 to $50. 

Reasonably satisfactory techniques are available for assessing the 
gain of a given sampling design in terms of its effect upon cost and 
accuracy, and for evaluation of processing operations. At present, tech- 
niques available for the assessment of a given interviewing procedure 
are not nearly so satisfactory. In some cases there may be evidence that 
one interviewing method gives greater accuracy than some other 
method. However, in almost all cases, we have had no specific determi- 
nation of how much improvement in accuracy is obtained by a given 
interviewing technique and cannot, therefore, determine whether the 
gain in accuracy justifies the increase in cost. Although present meth- 
ods for measurement of the error of an interview may fall far short of 
perfection in giving an objective evaluation of quality, they can often 
provide relative measures of reliability in the form of lower bounds. 
Such data are useful and usable in improving survey design. They also 
will provide a basis for improving techniques of measurement. The de- 
velopment of more satisfactory techniques for the measurement of in- 
terview error, and collection of data for the evaluation of specific survey 
methods, constitute in our opinion the most essential steps to be taken 
toward the improvement in survey design. 
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ON THE DISTINCTION BETWEEN ENUMERATIVE 
AND ANALYTIC SURVEYS* 


W. Epwarps DEMING 
Bureau of the Budget and New York University 


I, DEFINITION OF THE ENUMERATIVE AND ANALYTIC USES OF DATA 


urpose of the paper. Statistical data are supposedly collected to 

provide a rational basis for action. The action may call for the 
enumerative interpretation of the data, or it may call for the analytic 
interpretation. 

The aim here is to exhibit some of the consequences of failing to dis- 
tinguish between the enumerative and the analytic uses of data. This 
distinction is necessary in the statement of the aims of a survey, census, 
or experiment, in order that the plans for the collection of the data and 
for the tabulations may most economically meet the needs of the con- 
sumer, and it is equally important in the interpretation of data. 

Thus, to draw on a result from a later paragraph, information ob- 
tained in a complete census concerning every person in an area (e.g., 
on occupation, income, or education) still possesses for analytic pur- 
poses a sampling error that is actually about a quarter as great as the 
sampling error of a 6 per cent sample. The consequences are far-reach- 
ing. In using a census-table for analytic purposes, even though the 
figures come from a perfect complete count, it is therefore necessary 
to bear in mind that small numbers in a cell are unreliable in the sense 
that they have a standard error, just as if they had arisen in sampling, 
as indeed they did. Moreover, in the planning of a complete census, it 
is therefore imperative to use sampiing for every bit of information 
that is not necessary as an aid to complete coverage, or required to give 
detail for small areas (such as the block statistics). Name, relationship 
to the head, age, sex, marital stati... color are probably all necessary 
for the sake of completeness of coverage. These things, plus a few ques- 
tions on rent, tenure, year built, will provide the information required 
for the block statistics. 

To draw on another result, we shall see that it is often impossible to 
design a survey that will supply economically information for both 
enumerative and analytic purposes. For example, in a marketing sur- 
vey, the best design for an estimate of the number of people who prefer 
to use ground coffee at home, rather than soluble coffee, requires, for 





* Delivered at a conference on sampling conducted by the Institute of Statistics, University of 
North Carolina at the Blue Ridge Assembly, 21 July 1952. 
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greatest economy, one type of sample design; whereas a study of the 
reasons, or even of the difference in the two proportions, requires an- 
other design. One must be prepared to make some sacrifices in preci- 
sion, as it may not be economical to satisfy both aims simultaneously. 

The distinction between the enumerative and analytic uses of data.' 
Briefly, the enumerative question is how many? The analytic question 
is why? is there any difference between the two classes, and if so, how 
big are the differences? 

In the enumerative problem, some action is to be taken because the 
frequency of some particular characteristic of the universe is found to 
exceed some critical value. The crop of wheat, according to a sample 
survey, turns out to be large or small. As a consequence, the market 
goes down or up, and production of meat, cereal products, and of sub- 
stitutes shifts one way or another because of this information. The 
Census, or perhaps a sample study of birth registrations, shows that 
in a particular city the number of children in the primary schools will 
be much greater in 4 years than now. Bonds are issued; work com- 
mences on a new school building. Inspection of a sample of wool or 
of cotton may determine its disposition, the price to be paid for it, 
and what kind of cloth and of garments to make of it. Inspection of 
a lot of industrial product determines whether it will be accepted or 
subjected to screening or to a lower rating, or outright rejection. 

Such problems are enumerative because they depend purely on a 
determination of the number of people in an area, or the inventory 
of grain, or the production of grain, or the quality of a product. They 
do not involve the analytic question of why all these people are there 
or why the crop this year is what it is; or why the wool or cotton or 
product is so good or so bad. 

When certain cities in America were swelled with in-migrants be- 
cause of war production in the spring of 1944, special censuses were 
taken with the aim of arriving at equitable allocations of food, gaso- 
line, repair parts for buses and trolley cars, and other necessaries of 
living. Equitable distribution of supplies to these cities was impossi- 
ble because no one knew just how many people were in them: assertions 
of editors and chambers of commerce did not provide a basis for ac- 
tion. The problem was enumerative because the action (viz., allocation 





1 These are the terms that I invented for Chapter 7 in my book Some Theory of Sampling (John 
Wiley, 1950). The terms are not important; the concepts are. The concepts are old, but plain statements 
of what they are and of the consequences of failing to keep them in mind in design and analysis are not 
easy to find. Similar but not exactly parallel concepts occur in the analysis of variance, under the terms 
Model I and Model II, a lucid explanation of which occurs in the paper by Churchill Eisenhart, “The 
assumptions underlying the analysis of variance,” Biometrics, 3 (1947), 1-21. 








246 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1953 


of food and materials) depended on how many people were there and 
not why they were there. 

By law, the Social Security program is partly an enumerative prob- 
lem because federal reimbursement to a state depends on the number 
of inhabitants 65 and over within the state. Public health programs, 
agricultural adjustments, and other allotments depend on population 
and acreage, and are examples of enumerative uses of data. Adminis- 
trative problems concerned with the long-range aims of these programs, 
however, are analytic. 

In the analytic problem, the action is to be directed at the underly- 
ing causes that have made the frequencies of the various classes of the 
population what they are, in order to govern the frequencies of these 
classes in time to come. Familiar examples of analytic studies are found 
in intelligent city-planning. More familiar studies are the differential 
effects of varieties and of treatments in agriculture and entomology. 
The particular crops that are measured are of interest only because 
they aid decisions on what varieties and treatments to use for the 
best results in crops yet to be planted. We may run an experiment 
with a group of test animals or with patients in a hospital, but when 
we generalize from these tests we are thinking of the production 
process: what will it produce in the future? The present tests are 
important only because they help us to prescribe or to modify the 
treatment for future use. The control-chart is a splendid example, the 
purpose being to control the production process and the quality of lots 
yet to be made. Other examples are medical and social studies wherein 
interest centers in the causes that produce differences in health, fer- 
tility, or death-rate in different segments of a population of people. 
Current population surveys in the United States and Canada aid stud- 
ies of employment, unemployment, farm and industrial labor, school 
attendance, etc. The monthly sample of deaths by causes, published 
by the National Office of Vital Statistics in the United States, aids in 
the control of epidemics and the spread of disease. Its use is both 
enumerative and analytic. 

Special reference to the statistical control of quality. Both the enumera- 
tive and analytic problems present themselves hourly in the statistical 
control of quality. A batch of product has been produced, let us sup- 
pose, and the machine is already producing another batch. Two ques- 
tions arise: B (analytic). Shall we leave the machine alone, or shall 
we adjust it? Shall we make it run slower or faster, or shall we change 
the type of chemical bath? A (enumerative). What shall we do with 














DISTINCTION BETWEEN ENUMERATIVE AND ANALYTIC SURVEYS 247 


the batch of product just made? Shall we send it on to the next opera- 
tion (which might be into the consumer’s market, or into the next bay 
of the same factory for further work)? Or is the product so defective 
that we must re-work it, sell it as second-class, or scrap it? 

A chemical engineer whose specialty is the production process may 
have a special interest in Problem B, and little in Problem A, which he 
leaves to someone else. On the other hand, if we are the purchaser of a 
batch of product, such as a single automobile, or some paint for our 
home, or a carpet, we certainly have a special interest in Problem A. 
We wish to know the quality of this particular batch of product. It is 
little comfort to know that the process by which it was made was a 
good one, and was in a fine state of statistical control, if the product 
that we ourselves purchase turns out to be defective and unsuited to 
our purpose. A manufacturer, on the other hand, must purchase raw 
materials and assemblies in quantities, week in and week out. In order 
to cut the costs of these materials and to improve their quality, he 
must concern himself not only with Problem A, the inspection of these 
materials upon receipt; he must in addition take a lively interest in 
Problem B, the control of the production processes in the plants of his 
suppliers. 

The methods of the Shewhart control chart are essentially analytic, 
as they tell when to take action on the process. In contrast, the meth- 
ods of acceptance sampling are primarily enumerative, dealing with 
the disposal of a lot, although they react secondarily on the process by 
forcing better control where needed. 


II. THE SAMPLING VARIANCES FOR THE TWO 
TYPES OF DISTRIBUTION 


The two uses re-stated in terms of sampling distributions. Re-stated in 
terms of a mechanism for carrying out the sampling, we may distin- 
guish between the two uses (enumerative and analytic) by considera- 
tion of the two distinct types of repetition of the operations that lead 
to two distinct sampling distributions. In the enumerative case, we 
take repeated random samples from the same lot, and seek the sam- 
pling distribution of the mean or of other statistical measures of these 
samples. In the analytic case, we take repeated random lots from a sup- 
ply or cause system, and we select a random sample from each lot; then 
seek the sampling distribution of the mean or of other statistical meas- 
ures of these samples. 

The use to which the data will be put determines which of the two 
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types of repetition is applicable in any one problem. Unfortunately, 
sometimes we require data from the same survey to serve both pur- 
poses. 

It is helpful to look at a diagram. The figure shows three bowls with 
poker chips, all physically similar, some red and some white. By stir- 
ring the contents of any bowl thoroughly, and reaching in blind-folded, 
it is possible to satisfy satisfactorily the conditions for a random sam- 
ple. Another way is to give serial numbers to the chips and to draw 
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them with random numbers. The bow] on the left represents the process 
or cause system. It is a supply of chips. The bowl in the middle repre- 
sents the lot. It is the people in an area today, or a batch of product, or 
a crop. The lot has come, we suppose here, as a random sample from 
the process. This assumption is over-simplified; nevertheless it is a 
first step to an understanding of analytic problems. The small bow] at 
the right represents a sample drawn from the lot. 

The four possible different variances. Now we are able to state the 
two problems in terms of estimation. In the enumerative problem the 
sample is used for an estimate of the contents of the lot, which is de- 
scribed by the proporiions P and Q. In the analytic problem the sam- 
ple is used for an estimate of the contents of the supply, which is de- 
scribed by the propc:tions p and g. The same sample serves both pur- 
poses, but not equally well, for the two estimates have different 
variances. Hence the proper size of sample, and how to interpret a 
sample, will depend on whether the aim is enumerative or analytic. 
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The variances of r/n as estimates of p and of P are shown in the ac- 
companying table. There are four cases, A, B, C, D, depending on how 
the lot and the sample are drawn.? There are two interpretations 
(enumerative and analytic) in each case. 


Table of the variances of D and of P 
In all cases, Ep=p and EP =P 



































The N balls in the lot- The sample of n is drawn from the lot-container 
container are drawn 
from the supply With replacement Without replacement 
Case A Case B 
~~ _— 1 ~~ 
Analytic Var p =?) 1 + Var p | 
n N n 
WitH REPLACEMENT 
Enumerative Var P ane Var P= > = 
n N-1 n 
Case C Case D 
. =~ n-1 M-N ~ M-—n pq 
(p=—i1l1+—_—_ ——_ = — 
Analytic Vari p a + VN Wei Varp ——< 
Wirth REPLACEMENT 
Enumerative Var P mo Var P= - PQ 
n N-1 n 
APPLICATIONS 


Tabulation plans. The two variances, Var p and Var P, are different. A 
sample therefore contains different amounts of information for the two 
purposes. How do these observations help us in the design of samples? 
They tell us that if there is a definite enumerative aim in finding out 
how many people there are with a given characteristic however rare, 
then the tabulation and printing of small cells may be justified, pro- 
vided the universe will still retain enough of its characteristics by the 
time the sample is tabulated. 

On the other hand, if the aim is analytic, there is the sampling error 
o> even if the figure comes from a complete census, and this error may 
become troublesome in small cells. It will then be well to economize by 





2 For the derivations of the variances see the author’s Some Theory of Sampling (John Wiley, 1950), 
p. 254, 
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using sampling in the collection of such data, and to determine in ad- 
vance what consolidations may be made in the tabulating and in the 
printing. Much space and money are wasted annually on the tabulation 
of cells that are too small for analytic use, and which have no enumera- 
tive use. Too often the excuse for tabulating small cells in a complete 
census is that they came from a complete census and must therefore 
be correct. When the use of such tables is analytic only, such arguments 
do not hold: a reconsideration is due. 

Case B is one that often corresponds approximately to many prob- 
lems in real life. For enumerative use we take the proportion r/n in the 
sample to estimate the proportion P in the lot. The variance of the es- 
timate 





m r 
ro (1) 
n 
is seen in the table to be 
. N-—-nP 
Var P = eS, (2) 
N-1 n 


which reduces to 0 if the sample n is increased to a complete census 
of the lot, when n=N. 

In contrast, for analytic purposes in Case B we use the proportion 
r/n in the sample to estimate the proportion p in the supply. The vari- 
ance of the estimate 


, r 
p=— (3) 
n 
is seen in the table to be 
Yerto =. (4) 
n 


The size N of the lots, although they furnished the samples, does not 
enter into this variance at all. The size N only limits the size of the 
sample: it cannot be bigger than the lot. To reach greater precision 
than pq/N (the variance of a complete census) we must draw another 
lot and sample it also; then combine the two estimates of p. 

Effectiveness of a medical treatment. We may see from the following ex- 
ample the two possible ways of interpreting data. Three hundred 
ninety-eight patients in a hospital went under treatment for a certain 
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disease. After two months a random sample of 250 patients drawn from 
the list of the original 398 showed no sign of recurrence. How do we 
figure the odds that there are 0, 1, 2, etc., recurrences in the remaining 
148 patients? Stated another way, what is the highest number of recur- 
rences in the original 398 that would permit such a result as often as 
once in 20 repetitions of the sample? This question is framed as an 
enumerative one. 

This sample was drawn without replacement, and we may see from 
either Case B or D that the probabilities to apply are the terms of the 
hypergeometric series. 

Let K be the number of patients in the original 398 who actually 
have a recurrence, none of which showed up in the sample of 250. Then 
the probability of the result obtained is given by the hypergeometric 


term 
Crcmo Co) 
250 0 
(0) 
250 


_ 398 — K 398-— K—1 298 —K-2 
398 397 396 








etc. to 250 fractions. (5) 


A few results are tabled here, whence we may conclude with odds of 
about 19:1 that there are not more than 3 recurrences in the lot. 











K=NP - P, 
0 0 1 
3 -008 0.051 
4 .010 .019 
5 .012 .007 
6 .015 -002 
7 .018 .001 





Now suppose that the problem is to predict the proportion that 
would be cured in a succession of lots of patients. This is an analytic 
question, and the theory to use here then is the binomial series. If p 
is the proportion of recurrences in the general population (from which 
by assumption now the 398 patients is a random sample), then the 
probability of the observed result is 
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Foe rr =e. (6) 


It is interesting to note that the size 398 of the lot does not come into 
this probability at all. 

A few results are in the table here, whence by interpolation we may 
conclude with odds of about 19:1 that the proportion p of recurrences 
in the general population would not be more than 12 in 1000. 











Pp Po 

0 1 
-005 . 286 
-010 .081 
-015 .023 
-020 -006 
.025 .0002 





This example, though oversimplified, may help to guide the design 
and interpretation of the results of samples. It shows specifically how 
the interpretation changes when we change our aims from the enumera- 
tive to the analytic use. 

Allocation of sample. In the symbolism just introduced, the analytic 
aim is to measure the difference between the two proportions p; and p» 
which exist in two cause systems, or to find out if there is any signifi- 
cant difference between p; and pe. We cannot examine the cause sys- 
tems directly; we can only study two groups of farms, plots, patients, 
or pupils that the two cause systems have produced. That is, we can 
only study two lots, one from one cause system, and one from another. 
We shall assume that Case B fits the actual events. 

The lots may be of different sizes, Ni and N2. At any rate, we take 
samples therefrom of sizes m; and m2, and we ask what should their 
sizes be to minimize Var (/2— 4), an analytic purpose. The optimum 
allocation for this purpose requires that 


n= ko, = kV/ pin (7) 
Ne = koe = kv/poq (8) 

where 
= n/(o1 + 02). (9) 


Now usually, if not always, such problems require the aid of statistical 
techniques only if p: and p2 are not far apart; if the difference between 
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them were wide, we could observe it without help. Hence we may prop- 
erly take 


nN = Ne 


as optimum. Thus, the usual practice of taking equal sizes of sample 
for clinical or laboratory tests is correct for minimizing the variance 
of the estimated difference, regardless of the sizes of the populations or 
of the acreages of the crops whence the samples were drawn. 

In contrast, for the enumerative purpose of estimating the over-all 
average £ or total X of some characteristic (average rent, total number 
of unemployed, total acreage in wheat) in the two lots of size N; and 
No, the aim is to minimize the variance of < or of ¥. an enumerative 
purpose. To this end, the optimum size of sample wiii be 


m = nN,/N : , 
__+ [for proportionate allocation | (10) 
nN, = nN. 2/ N 
or, if one prefers, 
n= kN 10; : 7 - 
\ [for disproportionate allocation | (11) 
ne = kN 202 


where 
k= n/(Nioi 4. N202). 


Obviously, the optimum allocations for the two purposes will be dif- 
ferent except when the two lots N; and N2 are nearly equal in size. 

Unfortunately, the purposes of a survey are often both analytic and 
enumerative. In a survey to assist the marketing of a certain brand of 
frozen orange juice, we need to know not only how many people of 
various income levels buy frozen orange juice of a particular brand, but 
of all brands, and tinned unfrozen juices as well, and probably fresh 
fruit besides. These are enumerative counts. Then also, probably more 
important, the survey must discover why people of various groups buy 
or do not buy frozen orange juice and the products that compete with 
it. This kind of question is analytic. The design that is economical for 
one type may not be economical for the other. 

In another study, a research worker wishes to study the variation in 
the behavior of people, classified by age, education, marital status; 
perhaps also by religion, urban and rural residence, and occupation. 
Frequencies are important; but so also are the contributing causes. 
Thus, this research presents also both enumerative and analytic prob- 
lems. 
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One way out is to conduct two surveys—one for enumerative pur- 
poses to ascertain the frequencies of certain behavior by class; an- 
other, with a more intensive questionnaire, to study the causes. 

Another solution is to make some sort of compromise in the design, 
sacrificing economy and precision in (e.g.) the enumerative results, in 
order to gain something for the analytic uses. How much to sacrifice, 
how far to lean, and which way, can be settled only with consideration 
of the risks and of the losses of making a wrong decision on the basis 
of information not sufficiently precise, and on consideration of the ad- 
ditional cost of getting more precise information. 

A note on acceptance sampling. The probabilities that one encoun- 
ters in the analytic problem in Case B justify the customary 3-sigma 
control limits in the form 


3R (Ris the average range over a series of samples, 
dz/n and R/dz2 is an estimate of c) 





Z+ 
for the Z-chart, or 


eT" pq (pis the average fraction defective over a series 

sie n of samples) 
for the p-chart, as an aid for detecting uncontrolled variability. It will 
be ebserved that the size N of the lot does not appear in these equations 
even if the sample-size n is 100 per cent of N. This form of computa- 
tion is now seen to be correct; it is not an approximation. The justifica- 
tion is the absence of N and of any finite multiplier in the analytic 
Case B. 

On the other hand, many writers in dealing with the producer’s risk 
in acceptance sampling (the probability that a lot of acceptable qual- 
ity. will be rejected) have recommended hypergeometric terms (like 
that in Eq. 5), or rather have reluctantly used binomial terms as ap- 
proximations to the hypergeometric terms. Actually, however, the pro- 
ducer is concerned with the problem of keeping his process in control 
and at a desired level p. The quality P of the lots that he produces will 
vary from lot to lot, yet the risk (probability) that a lot will be rejected 
on a single-sampling plan turns out to be a sum of binomial terms typi- 


fied by 
n 
( ) q”"p’; 
» 


into which the size N of the lot does not enter at all. This problem be- 
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haves like the analytic Case B, for which the binomial terms are cor- 
rect; they are not approximations. 

The consumer’s risk is another story. The consumer is concerned with 
the particular lots of product that he is purchasing, and he has a sam- 
ple from each lot on the basis of which to decide whether to accept or 
to reject the lot. He may aim to guard against accepting lots with too 
high a value of P, regardless of p and of the state of control. To com- 
pute on this basis the correct probabilities for the consumer’s risk, one 
requires hypergeometric terms. The finite multiplier (VN —n)/(N—1) 
then ‘appears in the variance of P, because this is the enumerative Case 
B. 

There are thus, strictly, two operating-characteristic (O.C.) curves, 
one for the producer, another for the consumer. In practice, however, 
except when the sample is 20% or more of the lot, the two curves co- 
incide, almost, fortunately, and one curve suffices. 

The distinction made here between the different probabilities for the 
producer’s and consumer’s risks is not new. It forms the basis for the 
Dodge-Romig tables, as is clear from their text. An extremely lucid ex- 
position appears in the book Sampling Inspection by the Statistical Re- 
search Group, Columbia University (McGraw-Hill, 1947), pages 183 
and 184. I quote: 

There are two alternative ways of interpreting “percentage of defective 
items in submitted product,” and these lead to somewhat different O.C. 
curves for small inspection lots. (a) The percentage of defective items can 
be considered as applying to each inspection lot separately. ... (b) The 
percentage of defective items can be considered as applying to the proc- 


ess... . If interpretation (b) is adopted, the resultant O.C. curve does not 
depend on the size of the inspection lot. 











CONFIDENCE INTERVALS FOR THE NUMBER SHOWING 
A CERTAIN CHARACTERISTIC IN A POPULATION 
WHEN SAMPLING IS WITHOUT REPLACEMENT 


Leo Katz 
Michigan State College 


I. SUMMARY 


N A number of different sampling situations, we encounter a problem 
| which may be stated as follows: From a population of N objects, 
each of which possesses or does not possess a certain characteristic, we 
select without replacement a random sample of n objects. Observing that 
m of these possess the characteristic in question, we wish to estimate the 
total number M having the property. This problem arises in sample 
surveys of human populations and in sampling inspection of manu- 
factured articles as well as in other, less obvious, situations. 

It is well known (see e.g., F. F. Stephan’s solution, quoted by Dem- 
ing [2, p. 294]) that the maximum likelihood estimate for M is the 
largest integer in (m/n)(N+1). The problem of confidence interval esti- 
mation for M, however, seems to have escaped systematic investigation 
in the literature. In this paper, the problem is considered and two ap- 
proximate methods are giv. . for construction of these confidence inter- 
vals. On the basis of some preliminary investigations, it appears that 
the second approximation, which allows for correction for discontinuity 
jn the observations, agrees quite closely with the exact computations. 


II. FORMULATION OF THE PROBLEM 


The roughly similar problem of confidence interval construction for 
the parameter 7 in an infinite population or where sampling is done 
with replacement has been exhaustively studied by Clopper and Pear- 
son [1]. When N is very large, confidence intervals for M = Nx may be 
obtained in terms of the well-known results for x. When, however, N is 
small or moderately large, this approximation is not useful and exact 
methods or better approximations are needed. 

Consider the statement of the problem as one involving a fourfold 
table. There are two groups of n and (N—n), respectively. We observe 
that m of the n possess a certain characteristic. Assuming the grouping 
to be arbitrary and random, we wish to estimate the total number, M, 
in the combined groups, possessing the characteristic. Accordingly, we 
have the fourfold table, 
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m n—m™ | n 
M—-m N-M-n+m N-n 
M N—-M N 








where m, n and N are known at the outset, and M is to be estimated. 
It is well known, due to a result of R. A. Fisher given by Yates [6], that 
the exact distribution of m is hypergeometric with probabilities given 


by 
n'i(N — n)!M\(N — M)! 
N!m!(n — m)\(M — m)\(N — M —-n+™m)! 





(1) P,{m| N,n; M} = 


where the semicolon before M emphasizes that N and nm are known 
numbers while M is a fixed but unknown parameter in our problem. 

The known exact distribution (1) defines implicitly confidence inter- 
vals for M in terms of the fixed values of n and N and the observed 
value of m. The explicit construction of confidence intervals requires 
the existence of tables of the distributions; it is at this point that the 
difficulties occur. Finney [3] gives percentage points of the distributions 
for n and (N—n) up to 15 each. The author [5], in unpublished tables, 
has computed the cumulative distribution functions for all combina- 
tions with N up to 24. Thus, except for extremely small numbers, the 
exact confidence intervals are not available. We turn, therefore, to ap- 
proximations. 

Chung and DeLury [7] have recently published a set of charts! giving 
90 per cent, 95 per cent and 99 per cent confidence intervals for N = 500, 
2500 and 10,000 with sampling rates of .05 and .1 by tenths to .9. 
Interpolation for other population sizes using 1/./N as argument is 
satisfactory ; interpolation for other sampling rates than those specified 
is inconvenient, because neighboring values are read from different 
charts. Extrapolation beyond N = 10,000 is reasonably accurate. 

This leaves a gap between about N=25 and N=500, not likely to 
de filled by exact computation. The approximation given here may be 
useful in bridging this gap. For example, it might be possible to obtain 
approximate hypersurfaces for upper and lower confidence limits by 
this means, and then to devise systematic corrections in the direction 
of relatively few exactly computed points. 





1 See notice in this Journal, 46 (1951), 394-5. 
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III. APPROXIMATE CONFIDENCE INTERVAL CONSTRUCTION 


In the paper by Yates [6] to which reference was made above, it was 
shown that the exact distribution of m may be approximated by a re- 
lated variable having the x? distribution with one degree of freedom. 
(x, then, is a unit normal variable.) Yates further points out that, for 
relatively small fourfold tables, the approximation is much improved 
by correction for continuity, i.e., for the replacement of the discrete 
hypergeometric variable by a continuous variable. x? in the fourfold 
table above is given by 


_ N(mN — Mn)? 
~ n(N — n)M(N — M) 





(2) x 


If, in (2), the value of x? at the 100a percentage point and the known 
values of the other quantities are inserted, the resulting expression is a 
quadratic equation in M. The discriminant of the quadratic form being 
positive except in the trivial case, n=N, the two real roots are easily 
interpreted. The smaller root, M,, is the value for which x(m) is at the 
upper 50a percentage point and, hence, m is approximately at the 
upper 50a percentage point of its (exact) hypergeometric distribution; 
a similar statement holds for M2. The interval, M;,SM<M,, is then a 
confidence interval for M with confidence coefficient (1—a). 

In what has been said immediately above, no allowance has been 
made for the continuity correction. Since, for the smaller root, m is in 
the upper tail of its distribution, the appropriate correction for con- 
tinuity (following Yates) is subtraction of 1/2; for the larger root, we 
use m+ 4. Thus, the confidence intervals for M, without and with cor- 
rection for continuity, are, respectively, 





[tet 
— 





~4/ te - = balm? + (n — m)?] + (2m — a 
(3) 


IIA 





musk + (2 ) 
—— 





2 
+A4/k.? — = balm? + (n — m)?] + (2m — > 


(uncorrected) 











CONFIDENCE INTERVALS WHEN SAMPLING IS WITHOUT REPLACEMENT 259 


and 





2ka 





~ 4/ ba? —— alm = 9)* + (n= m+ 4)! + Om — 0 — ma 
(4) 





N 
SsMsz |e + (2m — n+ 1) 





+ 4/ kat — kal(m + 9) + (n — m —})*] + (Qm—n+ a 


(corrected) 
where ka=n+(1—n/N)x,?. 


IV. SOME EXAMPLES 


The three examples given below are chosen for their dissimilarity; 
they are not intended to be comprehensive. 

(¢) In a very small sample inquiry, we ask nine persons, randomly 
selected from a group of 100, whether they are in favor of a certain 
proposal and we find three in favor, six opposed. We wish to construct 
a 95 per cent confidence interval for the number, M, in the whole group, 
in favor of the proposal. Using (a) equation (3), (b) equation (4), and 
(c) exact hypergeometric methods, we obtain the following confidence 
intervals: 


(a) 13 < M <€ 63 (uncorrected approximation) 
(b) 10 S$ M &S 67 (corrected approximation) 
(c) 9=M S&S 68 (exact) 


The interval (6) actually almost included the possibility of 68, since the 
cutoff value was 67.97. The Clopper and Pearson chart would indicate 
M between 7 and 72 inclusive. 

(it) This example is somewhat unrealistic and is chosen only to illus- 
trate the use of existing tables and to point up the desirability of ex- 
tending these tables. In this case, having observed that three out of 
eight randomly chosen from a group of 20 exhibit a characteristic, we 
seek a 90 per cent confidence interval for the number M, in the complete 
group of 20. Here, the x? approximation, corrected for continuity, gives 
3.023 <M <13.042. These values are in excellent agreement with the 


& 
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exact computation based on the following values from the author’s 
table [5]: 


P,{m = 3| M = 3} = .0491 
P,{m = 3| M = 4} = .1531 
P,{m < 3| M = 13} = .0521 
P,{m < 3| M = 14} = .0181 


Examination of these probabilities indicates that, on the one hand, 
M =3 is almost acceptable and, on the other, M = 13 is barely accepta- 
ble at the 90 per cent level. 

(zit) Grant [4, pp. 412-14] considers in some detail a series of ac- 
ceptance sampling plans in which 5 per cent samples are taken from 
lots of 2500 items. In this series of plans, the acceptance numbers are 
0, 1, 2, 3 and 4, respectively. We now wish to construct 95 per cent 
confidence intervals for the number of defectives in the lot of 2500 
when we observe 0, 1, 2, 3 or 4 defectives in the sample of 125. The 
table below gives confidence intervals obtained by the corrected x? ap- 
proximation. 


95 per cent confidence intervals for number of defectives 
in lot of 2500 when sample size is 125. 





Number of defectives Confidence interval 
in sample (inclusive) 





0- 89 
2-122 
8-152 
16-180 
27-207 


rwOWe © 








Similar computations were carried out in this case for the normal ap- 
proximation to the binomial approximation to the hypergeometric dis- 
tribution. In every instance, a somewhat wider confidence interval was 
obtained ; the results, however, were in excellent agreement in the sense 
that the intervals in the table above fell wholly within the second set. 

In the case of the last example, it was possible to compare the confi- 
dence intervals obtained by our procedure with those given by the 
Chung and DeLury charts.! For each of the five acceptance numbers 
above, the Chung and DeLury confidence interval partially overlaps 
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the interval given above. For the acceptance number 3, for example, 
their confidence interval is (12, 170) as against our (16, 180). Exact 
computation of the lower confidence limit places it at 14, however, 
while the exact upper limit is at 169. 
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THE UP-AND-DOWN METHOD WITH SMALL SAMPLES* 


K. A. Brown.ex, J. L. Hopass, Jr., anD Murray ROSENBLATT 
University of Chicago 


I, INTRODUCTION 


HE up-and-down method for estimating the 50 per cent response 

point of quantal data was originaliy devised for testing the sensi- 
tivity of explosives. Dixon and Mood [6] suggested that this method 
might have advantages in other fields of application. They gave ap- 
proximate maximum likelihood estimates for the parameters of the 
(normal) response curve, and approximate formulas for the standard 
errors of these estimates. They pointed out that, on the basis of formu- 
las for the asymptotic variance, the up-and-down method was 30 or 
40 per cent more efficient than the usual probit analysis method. 

In spite of this efficiency advantage, the up-and-down method does 
not seem to have been given much consideration in such fields as bio- 
assay [10, p. 909] or fatigue testing of metals. It has suffered from two 
restrictions which, taken together, have greatly limited its usefulness 
in many experimental situations: 


(a) The trials must be made sequentially ; before each trial is started, 


the response to the preceding trials must be known. 

(b) The efficiency advantage has not been explored for small sam- 
ples: Dixon and Mood express the fear (p. 112), that “meas- 
ures of reliability may well be very misleading if the sample size 
is less than forty or fifty.” 


Thus, the minimum duration of the experiment would have to be forty 
or fifty times the mean time required for response. This disadvantage 
would, in many cases, more than outweigh the efficiency advantage. 
The main content of this paper is a report on computations made 
to determine the actual performance of some estimates for the mean 
dosage parameter yu, based on up-and-down series of length 10 or less. 
It is found that the Dixon-Mood formula for the asymptotic variance 
is reasonably reliable even in samples as small as 5 to 10. Thus, re- 
striction (b) is not necessary. As a consequence of this fact, the design 
of up-and-down experiments may be altered, so that several inde- 
pendent series are run simultaneously, without serious loss of accuracy. 
In this way, restriction (a) can be considerably reduced. Finally, the 
possibility of conducting an experiment with several short series run in 





* This work was supported by the Office of Naval Research. 
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parallel introduces a flexibility into the design, which permits us to ' 
take advantage of special features of some experimental situations with | 
a still further increase in efficiency. 

We have not considered the problem of estimating the scale parame- 
ter a. The reason for this is partly that yu is usually the parameter of 
greater interest, but primarily that with small samples no estimate for 
o can be accurate enough to have much value. Even if » were known, 
and even if the trials are conducted at stimuli giving the most efficient 
estimation [9], over 200 trials would be required to estimate o within 
20 per cent with confidence of 95 per cent. Our experience is that in 
most experimental situations, the scale parameter is sufficiently stable 
that the experimenter can guess its value in advance from past ex- 
perience more accurately than he can estimate it from a small sample 
[2, p. 476]. Fortunately, our procedures require only that ¢ be known 
within rough limits, and the performance of the estimates for u» are not 
sensitive to errors in the guessed value of co. 


II. THE ESTIMATE i 


We assume that the stimulus scale has been so chosen that the re- 
sponse curve is an integrated normal curve with parameters yu and o. 
That is, the probability of response P to stimulus y is given by 


(y—n) lo 1 . 
P= f —— o* !*qe. 
—_ V/2r 


We fix a system of equally spaced stimuli, yo tid, 7=0, 1, 2, ---, and 
perform the first trial at stimulus yo. Each subsequent trial is performed 
at stimulus d units below or above that of the immediately preceding 
trial, according as the immediately preceding trial did or did not evoke 
a positive response. We may without loss of generality choose our scale 
so that o=1, and we do this in order to simplify notation. 

The series of stimuli used in an up-and-down experiment form a 
stochastic process, whose principal feature of interest is that the stimuli 
tend to have a distribution centered at yu. In a series of length n, a 
natural estimate for u is simply the arithmetic mean of the n stimuli 
used. In such a series, however, the first stimulus yo does not contain 
any “information” about yu, since it was chosen in advance by the ex- 
perimenter. On the other hand, the level of stimulus for the (n+-1)st 
trial is informative and is known, even though this trial is not per- 
formed. Let C denote the sum of the stimuli used in trials 2, 3, ---, 
n+1; we shall term this the score. As our first estimate for » we shall 
consider 7 =C/n. 
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The estimate 7 is not quite the same as that given by Dixon and 
Mood, but the two are easily seen to be equivalent asymptotically, so 
we may use for @ the asymptotic variance formula derived in [6]. This 
is o4?=2G?/n, where G is a function of d and yo>—y. Dixon and Mood 
give a graph of the function G(d, yo—); a few representative values 
are tabled below. 











d G(d, 0) G(d, d/2) 
0 0.886 0.886 
2/3 0.961 0.961 
1 1.004 1.004 
3/2 1.076 1.102 





For given d, the extremes of G will occur approximately at yo—u=0 
and at ye—yn=d/2. It appears from this table that for d$3/2, G does 
not sensibly depend on its second argument. This means that the 
asymptotic accuracy of the estimate does not depend much on the 
“phasing” of » with the system of stimuli. 

Dixon and Mood recommend that d be chosen approximately equal 
to o. The experimenter must guess « and then set d equal to his guessed 
value. The table of G shows that he may err considerably in his guess 
for o without much affecting the asymptotic accuracy of the estimate 
for yu. 

Our concern is with the actual accuracy of @ for small n. We shall 
gauge this by means of the error variance E'(ji—)?, which is the natural 
generalization of the variance when biased estimates are being con- 
sidered. E(uij—y)? depends on n, yo—u, and d. It may be computed re- 
cursively. Let C,(yo, u, d) denote the score obtained in an up-and-down 
series of length n, with step size d, started at yo. Let Po denote the 
probability of positive response at stimulus yo. We have 


Yo td + Cryo + d, uw, d) if the first trial fails 


C, ’ id - j i 
+1(Yo, #, @) * —d+C,(yo — d, u, d) if the first trial succeeds 


therefore 
E[Cr+:(yo, Ky, d) | = E[Ci(yo, B, d) | + PoE [Cn(yo — d, p, d) | 
+ (1 — Po)E[Cx(yo + d, », d)] 
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id and 
30 
: E[Cns1(yo, HM, d) = (n + 1)u]? 
1S 
rd = Ely +d —n+C,.(yo+ d, u, d) — nu)?(1 — Po) 
eS 
+ E[yo-—d—w+Curl(yo — d, u, d) — nu}*Po 
-_ = E[Ci(yo, M, d) — p]? + 2(yo — wu) { E[Cass(yo, H, d) 
— (n + 1)u] — E[Ci(yo, u, d) — 2)} 
+ 2d{(1 — Po)E[Cn(yo + 4, wu, d) — ny] 
— PoE[C.(yo — d, uw, d) — nu)} 
= + (1 — Po)E[C.(yo + d, wu, d) — np]? 
+ PoE[C,(yo — d, u, d) — np)?. 
-() 
TABLE 1 
- EG) -» AND EG@i-w)3, d=1 
he —_ : 
\ 
he \ yore 
0 5 1 1.5 2 2.5 3 3.5 4 oA? 
n 
al 
od 1 0 li? Rid .634 1.046 1.512 2.003 2.500 3.000 2.016 
1.000 . 867 635 .651 1.182 2.312 4.016 6.253 9.000 . 
‘SS 
te ‘~ 0 116 242 ~422 -701 1.079 1.525 2.007 2.501 1.008 
: . 567 .284 .582 .557 .697 1.245 2.354 4.034 6.258 ‘ 
al ‘ 0 072 .180 .331  .529  .792 1.136 1.553 2.017 _ 
477 .365  «.431«.331.—St'—«wSS)=s«CwWBBs«d1“W3B7 «2.449 4.0811 ‘ 
‘a 
“a ‘ 0 068 .142 .249 413 .632 .809 1.219 1.602 _ 
872 .320 .378 .325 .423 .536 .968 1.592 2.621 ‘ 
e- 
_ P 0 .047. 116 © .212— ss .337—S's«w506~=——«.732)—s«1.006—:1.319 om 
‘ 328 .284 .311 .284 .362 .428 .700 1.100 1.843 ‘ 
e 
, 0 047.097 .170 «=-«.282.-—Ss«iw432—é‘<‘GSC*é«‘«‘ sd: - 
256 .254 .279 .252 .300 .359 .553 .824 1.346 . 
9 0 034 .084 .153 .243 .366 .529 .728 .956 -_- 
249 .227 1.240 .228 .267 .302 .443 .657 1.042 ‘ 
ds 
P 0 035 .073 .128 .213 .326 .464 .633 .838 - 
218 .206 .220 .205 .231 .266 .375 .529 # .827 ‘ 
. 0 026 .065 .137  .189 .285 .412 .568 .746 on 
200 .187 .195 .194 211 .232 317 .449 .683 ' 
- 0 033 .059 .103  .170 277. .371 .507 ~~ .671 _ 
180 .175  .178 .171 .189 .220 .281  .379 .570 ‘ 
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TABLE 2 
E(#)-u AND E(ui—u)? d=2/3 

















Yoru 
0 2/3 4/3 2 8/3 - 
n 
; 0 337 .788 1.364 2.005 om 
444 +449 = -.768.-—S—«i1:.899 4.027 
‘ 0 268  .609 1.090 1.687 - 
336 377 .564 1.260 2.862 
. 0 215.491 ‘877 «1.398 sad 
296 .315 449 883 «1.993 
, 0 177 ~—-. 403 725 «1.162 - 
259 278 364 663 —«:1..420 
0 149.339 609 984 
5 234 244 308 515 «1.060 -370 
0 128 —-.290 522 845 
6 209 220 264 418 ‘816 -308 
0 ll 253 455 737 
7 191 197 233 347 652 - 264 
, 0 098 —-.228 401 651 sis 
174 180. 206 297 533 
0 087 —-.199 358 581 
8 161 164  .186 258 447 -205 
0 079 ~—-.180 323 525 
10 148 152 169 228 381 -185 








These formulas enable us to compute [E(%)—y] and E(i—,z)*. Begin- 
ning with n=1 we have 


E(@) — w = E[Ci(yo, u, d)] — wu = yo — w+ (1 — 2Po) 
and 
E(& — u)? = E[Ci(yo, u, d) — p}? 
= (yo — uw + d)*(1 — Po) + (yo — w — d)*Po. 
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TABLE 3 
E(i)—» AND E(i—y)*, d=3/2 











your 
0 3/2 3 9/2 6 ou 
n 
, 0 200 1.504 3.000 4.500 | ogg 
2.225 601 2.274 9.000 20.240 
. 0 150 854 2.252 3.750 ane 
863 864 ‘881 5.075 14.063 
: 3 105 604 «1.569 ~—S—- 3.001 = 
734 527 682 2.530 9.010 
, 0 080 456 1.203 2.302 - 
496 494 503. «1.625. ~Ss«#5 837 
0 064 366 965 ‘1.862 
445 371 449 1.120 3.582 -463 
0 054 305 805 1.554 
6 347 346 350 (867 2.546 -386 
‘ 0 046 262 690 1.333 - 
320 282 322 665  —«:1.988 
0 040 229 604 —«:1.167 
8 267 266 268 559 ~—«-1.506 290 
0 036 204 537 —«:1..037 
® 250 207 251 459 «1.229 257 
0 032 183 483 933 
10 216 216 217 404. —«-1..009 232 











From these the moments for n> 1 follow recursively by using the above 
formulae. 

Tables 1, 2, and 3 give E(fi)—y and E(i—yp)? for d=1, 2/3, and 3/2 
respectively, and for n=1(1)10. In Table 1, yo—u=0(1/2) 3, while in 
Tables 2 and 3, y¥o— 4 =0(d) 3d. For comparison we give also the asymp- 
totic variance of o42=2G%(d, d/4)/n. We see that in all tables, if 
Yo—#S2d, the actual E(fi— yp)? is less than o4?, but that as yo—yp in- 
creases beyond about (5/2)d, E(i—j)* rapidly increases, particularly 
for small d. This means that the asymptotic variance gives reliable or 
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even conservative estimates for the accuracy of #@ in small samples, 
provided the experimenter can manage to start the process within, say, 
two testing intervals, 2d, of the mean, uz. 


III. THE ESTIMATE yp’ 


In many experimental situations the experimenter will be able to 
guess the value of u to within 2 steps, where the step is the guessed 
value of oc. In these situations the estimate é is quite satisfactory, being 
both simple and accurate. But it is always possible that the guess will 
be badly in error, and we now present a modified up-and-down pro- 
cedure which, by sacrificing some efficiency when yo—y is small, pro- 
vides an estimate with a guaranteed accuracy independent of yo—u. 
It is clear that if we start an up-and-down series at too high a stimulus, 
we shall very probably obtain only positive responses until the stimulus 
level has been reduced to a value near u. Similarly, if yo is far below yu, 
this fact will probably reveal itself, through an initial run of negative 
responses. This suggests that we should interpret an initial run of re- 
sponses of the same sign as an indication that yo was badly chosen, and 
that these trials may well be ignored in estimating uy.’ 

We are thus led to formulate the following design. Choose n in ad- 
vance, and continue the experiment until there have been n—1 trials 
in addition to the trials in the initial run of constant sign. Thus, the 
total number N of trials is a random variable, N =n, with N =n only 
if the first and second trials give contrary responses. As score, we use 
C’, the sum of the stimuli on the last n—1 trials and on the (V+1)st 
trial (which is, of course, not performed). For our estimate, we use 
hp’ =C'/n. 

To illustrate, let d2=1, yo=0, n=3. Suppose that the first two trials 
failed, but the third and fourth trials succeeded. This means that the 
successive stimulus levels were 0, 1, 2, and 1, and if a fifth trial was to 
be performed, it would be at level 0. Here N=4, C’=2+1+0=3, 
p’ =3/3=1. 

Values of E(u’)—y and E(u’—,)? can be computed from those for 
E(i)—y and E(fi—y)?. We first compute the probability of each possi- 
ble initial run. Each such run leads to a specific starting value for a 
series whose moments can be obtained from tables of E(jfi)—y and 
E(ji—p)*. We give in Table 4 values of E(u')—yu and E(u’—p)? for 
d=2/3, 1, 3/2, yo=0(d) 3d, and n=5 and 10. In Table 5 we give the 
expected number of wasted observations, E(N)—n, for d=2/3, 1, 3/2 
and yo—n=0(d) 3d. 


1 The estimate recommended by Dixon and Mood also attempts to avoid using the results of ini- 
tial trials made far from yu, but in a way different from that proposed here. 
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An appropriate measure for the comparison of two methods of esti- 
mation is provided by the inverse of the sampling variance V of the 
statistic being estimated. We may thus, following Fisher [8], define a 
quantity I such that J[=1/V. In the case of the comparison of ex- 
periments involving different numbers of observations it is fairest to 
make the comparison on a per unit observation basis, so we use 
I'=I/n=1/nV. 

In choosing between the estimates f and yu’, several considerations 
should be kept in mind. If the initial guess for y is fairly accurate, i 
will have the smaller error variance. In addition, yu’ involves “wasted” 
observations. Comparing rE (# — yu)? with E(N) E(u’ —u)? for n=10, 7 is 
seen to be more efficient than yn’ when yp—p=0, d, or 2d for d=2/3, 1, 
and 3/2. On the other hand, u’ is more efficient for large values of 
|yo—u|. Furthermore, in using »’ we have a known upper bound on 
the error variance, depending on d but independent of yo—y. Also, the 
bias of x’ is bounded and is considerably smaller than that of @ for 
Yo—u=2d, and 3d. 

An examination of Tables 4 and 5 suggests a further modification of 
the sampling method which may have desirable features in some ex- 
perimental situations. We observe (a) that the smallest value of d gives 
































TABLE 4 
E(u’—u) AND E(u’—yz)?, n=5 AND 10 
You 
d n oa? 
0 d 2d 3d 
5 0 .158 . 267 .317 
. 267 317 .423 .513 
2/3 
10 0 .083 .141 -168 
.157 .167 .199 .224 .185 
5 0 .128 .187 .198 
.357 .410 485 .516 
1 
10 0 .065 .095 .101 
-191 .202 .222 . 230 . 202 
5 0 .074 .096 .097 
.463 .507 .547 .551 
3/2 
10 0 .037 .048 .049 
.238 . 240 .251 .252 .232 
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TABLE 5 


EXPECTED NUMBER OF OBSERVATIONS USED IN APPROACHING 
uw BUT NOT USED IN THE ESTIMATE OF ug, E(N)—n 











ve—s d=2/3 d=1 d=3/2 
0 0.276 0.162 0.067 
d 0.500 0.493 0.498 
2d 1.115 1.300 1.429 
3d 1.976 2.274 2.428 





the smallest values for E(uz’—,)? while (b) the largest value of d gives 
the smallest value of E(N), for a given value of yo—y. This is intui- 
tively to be expected: a few big steps will get the process into the 
neighborhood of u; but once there, small steps will keep the process 
closer to p. 

Now suppose we begin the process with big steps, but change to 
small steps with the first change of sign. This design may be expected 
to have the small E(N) characteristic of big steps, and also the small 
E(u'—,)? characteristic of small steps. As an illustration, suppose we 
begin at yo—u=4.5 with d=3/2, and at the first change of sign change 
to d=2/3. We shall have E(N) = 12.43, and a simple computation (us- 
ing graphical interpolation in Table 5) gives E(u’ — yn)? =0.22, approxi- 
mately. Thus E(N) E(u’—yu)?=2.73, approximately. Had we used 
d=3/2 throughout, we should have had E(N) E(u’ —)? = 12.43 0.252 
=3.13, while the use of d=2/3 throughout would give E(N) E(u’—y)? 
= 16.7 X0.224 =3.74, approximately. The change in step size has given 
a substantial reduction in E(N) E(p’—yp)?. 

An elaboration of this idea would involve a series of step sizes, de- 
creasing to zero. Such a model would have an asymptotic efficiency 
with E(N) E(u’—)?=1.57. In this connection, see [12]. 


IV. COMPARISON OF EFFICIENCIES OF UP-AND-DOWN METHOD 
WITH THAT OF PROBIT ANALYSIS 


A comparison of the efficiencies of the estimates f and yp’ with that 
of the standard probit analysis is of interest but rather difficult. In the 
case of f and y’ the accuracies depend on the choice of d, and for the 
probit method the accuracy of the estimate is markedly affected by the 
choice of the number of stimuli, their spacing, and the distribution of 
the trials among them. Also, the true error variance of the probit esti- 
mate does not seem to be easily obtainable. 

We can, however, make the comparison on the following basis. For 





me 
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the probit method, we shall use a very common design, namely five 
stimulus levels equally spaced at intervals d with equal numbers of 
trials at eack level. We shall assume that the stimulus interval d repre- 
sents the experimenter’s guess for o, and shall use this same value for 
the step-size in the up-and-down series. 

We shall gauge the accuracy of the probit estimate by the usual ap- 
proximate formula for the asymptotic variance oc,” (equation 3.6 of [7]): 





_ ~| 1 4 z° | 
te om d wi > wiz? — #2 >> w; 


where x; are the stimuli, w; their probit weights, m the number of trials 
at each stimulus, and =) -w;2;/) wi. 

In Table 6, column 3 gives 1/I’ for the probit design specified above 
for yo—u=0, d, 2d, and 3d and d=2/3, 1, and 3/2. From the formula 
given for o,? we calculated mo,? and then no,?, where n=5m=total 
number of observations. Columns 4 and 8 are obtained by abstracting 
the appropriate values of E(j—,)? from Tables 1, 2, and 3 and multi- 
plying by n=5 and 10. For columns 6 and 10, E(N) is obtained from 
Table 5 and E(u’—,)* from Table 4. Column 5 gives R, defined as 


ee (I’ for up-and-down estimate ;2) 
= (I’ for probit method) 





and column 11 gives R; defined similarly but for yu’. Thus nR, is the 
number of observations required with the probit method to give the 
same error variance as would be obtained from jf with n observations, 
and E(N)Rz, is the number of observations required with the probit 
method to give the same error variance as would be obtained from p’ 
with a number of observations whose expected value is E(N). 

All values of R; and R, exceed 1, indicating that in all the circum- 
stances considered the probit method requires more observations. When 
d=1 (which is the usual objective for d), the probit method requires 
at least 45 per cent more observations than yp’, and at least 55 per cent 
more observations than ji, for all values of yo— considered. 

It might be felt that the above comparison is unfair to the probit 
method, in that we have judged its accuracy by an asymptotic formula. 
The only evidence on this point available to us, however, tends to show 
that the contrary is true. Berkson [2] has reported on an extensive 
sampling experiment one of whose purposes was to evaluate the small- 
sample performance of the maximum likelihood estimate for the pa- 
rameters of the logistic function. This function is scarcely distinguish- 
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able from the integrated normal. For thirty trials evenly divided be- 
tween three equally spaced levels, in estimating the location parameter 
with the scale parameter fixed, he found that the actual error variance 
of the maximum likelihood estimate was some 10 per cent greater than 
predicted by the asymptotic formula. (It should, however, be remarked 
that Berkson was able to attain or even slightly better the asymptotic 
variance with a minimum chi-square estimate.) While we realize that 
Berkson’s results are not exactly comparable, they lead us to feel that 
the efficiency advantage shown fo: the up-and-down method in Table 6 
is conservative. Further information may be forthcoming with the pub- 
lication of [3]. 























TABLE 6 
COMPARISON OF EFFICIENCIES OF 4 AND yp’ WITF THAT OF A PROBIT ANALYSIS 
n=5 n=10 
d | ye-u| ney? 

nE(i@—u)? Ri E(N)E(y’—p)? Rs |nE(i@—p)? Ri E(N)E(u'—y)? Rs 
2/3 0 2.11 3.39 1.81 1.41 1.50 1.48 1.43 1.61 1.31 
d 2.68 1.22 2.19 1.74 1.54 1.52 1.76 1.75 1.53 
2d 5.97 1.54 3.88 2.59 2.31 1.69 3.53 2.21 2.70 
3d 23.6 2.58 9.17 3.58 6.60 2.28 10.4 2.68 8.80 
1 0 2.82 1.64 1.72 1.84 1.53 1.80 1.56 1.94 1.45 
d 3.13 1.56 2.01 2.25 1.39 1.78 1.76 2.12 1.48 
2d 6.98 1.81 3.86 3.06 2.29 1.89 3.70 2.51 2.78 
3d 63.0 3.50 18.0 3.75 16.8 2.81 22.4 2.82 22.3 
3/2 0 4.15 2.22 1.87 2.62 1.58 2.16 1.92 2.54 1.64 
d 4.21 1.86 2.27 2.79 1.51 2.16 1.95 2.52 1.67 
2d wien 2.24 3.43 3.52 2.19 2.17 3.55 2.87 2.69 

3d [414 5.60 74.0 4.09 101 4.04 103 3.13 132 

















We remark that the sense of the comparison of Table 6 is not de- 
pendent on the particular probit design we chose. A probit design which 
concentrated the trials more tightly would gain in efficiency somewhat 
for yo near yu, but would pay for this by blowing up more quickly as the 
initial guess became bad; and conversely, a looser distribution of the 
trials would give fair performance ever a wide range of yo values, but 
would be even less efficient at yo near p. 


V. DESIGNS WHICH REDUCE THE LENGTH OF THE EXPERIMENT 


The necessity of performing the trials sequentially is the principal 
objection to the use of the up-and-down method in many quantal re- 
sponse experiments. The total time required is likely to be prohibitive 
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unless the response to the stimulus is immediate, as for example in 
explosives sensitivity testing.” 

We shall now investigate the possibility of mitigating this disad- 
vantage by running several short up-and-down series simultaneously, 
rather than a single long series. Let there be k series, each of length n, 
with m=kn the total number of trials. We shall conduct the series 
independently of each other, and use the same initial stimulus yo and 
the same step size d for all k series. From the ith series we form the 
estimate y;’ as in section 3, and for our over-all estimate use 


k 
lina ( - u') / k. 
i=l 

Clearly o;?=0,?/k, while E(z) = E(u’). 

Thus 
E(a — )? = 072 + (E@) — }? 
= E(u! — u)?/k + [E(u’) — w]*(k — 1)/k. 

The error variance of 7 may be obtained at once from Table 4 for 
d=2/3, 1, 3/2, n=5 and 10. 

To illustrate, consider four independent series each of length ten 
(k=4, n=10, m=40). The following table gives the error variance cor- 


responding to several value of yo—y, where we have taken d=1. We 
also give the values of R.=(400,?)/[4E(N)E(a—1u)’]. 











Yoru E(u — 4)? R, 
0 0.048 1.45 
1 0.054 1.39 
2 0.062 2.48 
3 0.065 19.7 
4 0.066 _ 








? It should not, however, be thought that the total time required to run an up-and-down series is 
n times as great as the time required to conduct a probit experiment with n trials. In the first place, the 
greater efficiency of the up-and-down method means that fewer than n trials are needed with it: as is 
explained in Section 4, something like 2n/3 trials will give the same accuracy as is obtained from a 
probit analysis based on n trials. Secondly, in many experiments, particularly in biological ones, the 
response time is not fixed but variable. The time zequired for the probit experiment is thus the mazi- 
mum of n response times while that required for the up-and-down method is the sum of 2n/3 response 
times. To illustrate, if response times have a chi-square distribution with six degrees of freedom (this 
distribution fits fairly well a series of response times published by Perry [11, p. 40]) and if n =30, the 
median time required to complete the up-and-down experiment will be 7.6 times as long as that for the 
probit experiment, rather than 30 times as long as one might at first think. 
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For a single series of length 40, the variance as given by the asymptotic 
formula is 1.008/20 =0.050. It appears that there is no appreciable loss 
of efficiency in breaking a series of 40 into four series of 10 provided 
that a good guess for u is made. We are also assured of a low bound on 
the error variance even if the guess is bad. For all values of yo— con- 
sidered, the ratio R, is at least 1.30. This indicates that a 5-level probit 
design would require at least 30 per cent more observations to yield an 
estimate of the accuracy of @, even for the most favorable choice of yp. 


VI, COMPARATIVE EXPERIMENTS 


In much work of the type considered in this paper, we do not make an 
absolute determination but rather a comparison of an unknown with 
some standard. What is desired in these situations is an estimate for 
the difference between the values of yu associated with two treatments. 
For example, in bioassay we may wish to estimate the amount by 
which the LD 50, mw, of a new drug falls below that, u2, of a standard 
drug. It is clear that the up-and-down method may be adapted to this 
kind of problem. We can simply run two series, one for each drug, and 
take fi;—ji2 as our estimate for u4i—2. The possibility of running sev- 
eral short up-and-down series opens up additional ways of economizing 
on sample sizes. 

It is well known that the response of an animal to a given dose is 
affected by (amongst other things) its weight, age, and previous history. 
The usual method of avoiding these disturbances is to select animals as 
homogeneous as possible, or, in the case of weight, to make the log 
dose simply proportional to the log weight. While in some cases this 
procedure will achieve a satisfactory compensation for the varying ani- 
mal weights there are others where the relationship is either unknown 
or less simple. It is not safe to assume simple proportionality with a 
regression coefficient of unity, as regression coefficients of log dose on 
log weight as diverse as 0.573 [5] and 1.511 [4] have been reported. 

In an orthodox probit comparative experiment, stratification with 
respect to some concomitant variable, such as animal weight, is possi- 
ble, but it is not practical to take account of it in the analysis. In an 
up-and-down comparative experiment, however, the whole series of 
say 80 animals can be broken up on the basis of weight into say 4 
groups of 20, and 10 animals from each such group allocated at random 
to the standard and 10 to the unknown. This would give us more 
homogeneous material and a correspondingly reduced standard error. 

Due to a paucity of suitable data in the literature, it is difficult to 
predict what magnitude of gain may be expected through this sub- 
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division of the material. However, as an example we may take Behren’s 
data [1] on digitalis in frogs. For February 1928 he measured the fatal 
dose of digitalis for 38 frogs varying in weight from 21.8 to 37.5 grams, 
and as usual expressed the dose as a fraction of the frogs’ weight. How- 
ever, a plot of dose, so defined, against frogs’ weight discloses that the 
heavier frogs require a smaller dose. If we rank the frogs in order of 
increasing weight and break up the group of 38 into 4 groups of 8, 10, 
10, and 10, then the variance of dose in his units within these groups is 
0.0035550 compared with an over-all variance of 0.0046134. The ratio 
of these variances is 0.771: thus 7.7 animals when used in the smaller 
groups will give the same information as 10 animals in the complete 
group. 

In the case of relatively large experiments, of course, the possibility 
exists of obtaining still greater homogeneity by classifying the animals 
into groups on the basis of more than one parameter. 

A further possibility is the re-use of survivors from previous experi- 
ments. In general, we would expect the tolerance to change, and the 
inclusion of these animals in a comparative probit experiment would 
increase the heterogeneity of the material. However, in an up-and-down 
comparative experiment broken up into small groups, the survivors 
from earlier experiments could form some of the groups. This procedure 
if used with discretion should allow a saving of up to 50 per cent in the 
usage of animals with the up-and-down method. There is also, of 
course, a furthey duction compared with probit experiments due to 
the relatively greater efficiency of the up-and-down method. 


VII. CONCLUDING REMARKS 


In choosing between the standard probit method and the up-and- 
down method for estimating », the following points may be kept in 
mind. 

(1) The probit method will require substantially more observations. 
It shows up best, relatively, when the guess for pu is accurate, but even 
here (using a standard 5-level probit design with stimulus interval 
equal to one standard deviation) the probit method requires at least 
50 per cent more trials than does the up-and-down estimate ji in series 
of length 10. The comparison becomes rapidly less favorable to the 
probit method if the initial guess is not accurate. When the central 
stimulus is off-center by two standard deviations, the probit method 
requires approximately 3 times as many observations as ... 

(2) With the up-and-down method applied sequentially, it is possible 
to obtain an estimate uw’ with a guaranteed accuracy, regardless of 
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initial guess for » (but depending, as does the probit method, on the 
guess for c). This estimate, while less efficient than @ for small yo—y, is 
still more efficient than the probit method, which requires at least 40 
per cent more trials on the average to give the same accuracy, when the 
guess for o is correct. 

(3) The small-sample performance of the up-and-down estimates is 
known, whereas that of the probit method is unknown and difficult to 
compute. The above comparisons are made on the basis of the asymp- 
totic variance formula for the probit method, which is probably opti- 
mistic. 

(4) The up-and-down estimates are easy to compute arithmetically. 
The probit estimate must be computed either by a troublesome itera- 
tive method, or by a graphical method involving the judgement (and 
hence the bias) of the computer. 

(5) The up-and-down estimates always exist, whereas the probit esti- 
mate may not exist. In small samples with y—4wu large, there is a sub- 
stantial probability that the probit estimate will not exist. 

(6) The probit method has the advantage that all trials may be per- 
formed simultaneously, whereas the up-and-down method requires that 
trials be made sequentially. We may, however, run the up-and-down 
trials in a number of parallel short series, without much loss of accuracy. 
In a typical situation, the up-and-down experiment may take 2 to 5 
times as long to complete as the probit experiment. 

(7) The up-and-down method, when arranged in short series in an 
experiment to compare treatments, can take advantage of any classifi- 
cation of experimental material designed to reduce the variance. This 
may lead to a further substantial increase in efficiency relative to the 
probit method. 

In summary, we believe that the up-and-down method will prove 
superior to the probit method in any situation wherein the arrangement 
of the trials in series is not prohibited by experimental or cost considera- 
tions. Certainly there are many laboratories now using the probit 
method which would profit from a change to the up-and-down design. 

Our thanks are due to Mr. J. McKibben for much of the computation 
invIuded in this paper. 
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DESIGNING SINGLE-SAMPLING INSPECTION PLANS 
WHEN THE SAMPLE SIZE IS FIXED 


ABRAHAM GOLUB* 
Ballistic Research Laboratories, Aberdeen Proving Ground 


A simple technique is developed for determining “best” sin- 
gle sampling inspection plans when the sample size is fixed. 
Two cases are considered: (1) single sampling plans for placing 
a lot into one of two categories and (2) single sampling plans 
for placing a lot into one of three categories. For case (1), the 
“best” plans are based upon a criterion of minimizing the sums 
of the producer’s and consumer’s risks. A similar criterion is 
employed as a basis for choosing “best” plans for case (2). The 
more general result for m categories is given also. Tables 1 
through 8 are presented to enable the user to choose the ap- 
propriate sampling plans. 


INTRODUCTION AND SUMMARY 


HERE have been various approaches to the problem of designing 

single-sampling plans for acceptance sampling by attributes. It is 
generally accepted, however, that the primary objectives are twofold: 
(1) to accept at least a stated high proportion, say 1—a, of lots having 
true fraction defective not exceeding a given acceptable quality, say 
pi and (2) to accept not more than a stated small proportion, say 6, of 
lots having true fraction defective not less than a given objectionable 
quality, say pe. The quantities a and 8 are commonly known as the 
producer’s and consumer’s risks, respectively. The problem of selecting 
a single sampling plan is one of determining the parameter n (sample 
size) and the parameter c (acceptance number) which will simultane- 
ously attain both of the aforementioned objectives. Grubbs has solved 
this problem and by using his tables it is quite simple to choose the 
single sampling plan which gives the desired protection.! 

There are many occasions in practice where for economic, adminis- 
trative, or practical reasons n is fixed and small (less than 50). For 
these cases, the attainment of the desired protection is usually impossi- 
ble. A plan could be devised in which a or @ is controlled; however, this 
may unnecessarily penalize either the consumer or producer. With n 
fixed, therefore, the problem is one of choosing a “best” compromise 
plan based upon some criterion. This paper proposes a solution of this 
problem. 





* The author wishes to express his appreciation to Dr. F. E. Grubbs for reviewing this paper and 
for his continued interest and stimulating comments throughout its preparation. 

1 Grubbs was not necessarily the first to consider this problem; his tables, however, give his solu- 
tion important advantages. See Frank E. Grubbs, “On designing single-sampling inspection plans,” 
Annals of Mathematical Statistics, 20 (1949), 242-56. 
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The criterion which is employed as a basis for a “best” plan is that 
the sum of the probabilities of accepting lots of true quality p: (accept- 
able quality) and rejecting lots of true quality p2 (objectionable qual- 
ity) be a maximum. Mathematically, the expression which is to be 
maximized may be written as 


(1) P = Pr»,(A) + Pr»,(R) 


where Pr,,(A) represents the probability of accepting lots of true qual- 
ity =p: and Pr,,(R) represents the probability of rejecting lots of true 
quality = pe. It is noted that in employing this criterion a and 6 may 
vary, whereas in the basic problem a and 8 are set values. For fixed n, 
the value of c which maximizes (1) is the integer nearest to 








1 4 n 
c= = =< ’ 
2 Pe 
(2) — 
1 44 
qi 
log — 
q2 


where q; =1--p: and g.=1—pz. 

This paper also considers the problem of obtaining single sampling 
plans involving two acceptance parameters, c; and ¢2, for the purpose 
of placing a lot into one of three categories, say 1, 2, or 3. Three quality 
levels are set; p: for category 1, p2 for category 2, and ps; for category 3. 
In this case a similar criterion for a “best” plan is employed. Mathe- 
matically the expression which is to be maximized may be written as 


(3) P = Pry,(1) + Prp,(2) + Prp,(3) 


where Pr,,(1) represents the probability of placing lots of quality p: 
into category 1, Prz,(2) represents the probability of placing lots of 
quality pe into category 2, and Pr,,(3) represents the probability of 
placing lots of quality ps into category 3. This criterion obviously 
minimizes the sum of the probabilities of misclassification. 

The solutions for the c; and cz which maximize (3) are the integers 
nearest to 








1 4 n 
a=-— 
1 2 De 
(4) log — 
Pi 44 
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1 4 n 
Q=-— 

2 2 Ds 

(5) ~~. 
a 

q2 

log — 

q3 


For the general case of classification into m categories, the solution 
for the c; are 











1 4 n 
ee SS 
2 Pi i<j 
6) log 3 
eel ae 
qi 
log 
Qi+1 


The tables at the end of this paper are sampling tables which enable 
the user to select the proper acceptance numbers for single sampling, 
based upon the solution developed herein.? The sample size, n, ranges 
from five through forty in intervals of five. The lot qualities range from 
.01 to .20 for acceptable qualities and up to .40 for objectionable quali- 
ties. Lot qualities are tabulated in intervals of .01. 

Examination of formulas (2), (4) and (5) indicates that these same 
tables can be used also to choose c; and c2 in obtaining solutions for 
classification into three categories. This is accomplished by repeated 
entry into the tables, first with p; and pz to obtain c; and then with pz 
and p; to obtain cz. In a similar manner, the c; for the general case of 
m categories can be obtained from the tables. 

The solutions developed in this paper are based upon the assumption 
of a Binomial Population. 

In summary, tables are provided which enable the user to determine 
quickly the acceptance numbers, c, for fixed n, so that the sum of the 
probabilities of misgrading or misclassifying is a minimum. For those 
cases where n exceeds the range of the tables, a simple computation 
employing formula (2) yields the desired results. 





2 These tables originally appeared in this author’s “Acceptance Numbers for Placing a Lot From 
Which a Single Sample is Drawr. into One of Three Grades,” Memorandum Report No. 581, Ballistic 
Research Laboratories, Aberdeen Proving Grounds, Md. (Nov. 1951). 
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EXAMPLES 
I, Given: 

n = 40 

pr = .Ol 

Po = .08. 


Find c so that a+ is a minimum. 

The last table yields c=1 when entered with p,=.01 and p.=.08. 
Thus the single sampling plan which minimizes a+8 is n=40; c=1. 
Specifically, a=.06; 8=.16 and a+8=.22. 


II. Given: 
n = 25 
a = O01 
DP = .10 
Ds: = 30. 


Find ¢c; and cz so that Pr,,(1)-+Pr,,(2)+Pr,,(3) is a maximum. 

The fifth table yields c.=0 when entered with p,;=.01 and p.=.10 
and also c.=4 when entered with po=.10 and p3=.30. Specifically, 
Pry(1) =.778; Pr 2(2) =.830; and Pr,3(3) =.910 and their sum is 2.518. 


THEORETICAL BASIS FOR CONSTRUCTION OF TABLES 1 THROUGH 8 


Assuming an acceptable quality level of p; and an objectionable qual- 
ity level of p2(pi1<p2) and the sample size n fixed, we can write 


: n! ae 
(7) Pr,,(A) = 2) ——— niin 
imoetl(n va t)! 


c n! 
8 oi inns 
(8) rahe . > t\(n — 2)! 





where, assuming a binomial population, Pr,,(A) represents the prob- 
ability of accepting lots of true quality p: and Pr,,(R) represents the 
probability of rejecting lots of true quality pz. The criterion for a “best” 
plan, as outlined in the introduction, is that 


(9) P = Pr,,(A) + Pr>,(R) = maximum. 
For c to be the acceptance number that maximizes (9), provided that 
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(9) has but one maximum, the following functional relationships must 
hold: 


(10) P,-—- Pi>0O0 


where the subscript denotes the acceptance number. Expanding (10) 
and (11) yields 


n! 
(12) ————. (piqi** — pra°g:*"*) > 0 
cl(n — c)! 


n! 
13 ctly.n—(e+1) — et+1y.n—(c+1) a 0. 
om (e+ 1)!(n —c — 1)! (ria ills 


Dividing both sides of the inequality (12) by the positive quantity 
n\/[cl(n—c)!] gives 





(14) D1°qi"-* — p2°q2"-* > O 
which is equivalent to 
2P192\° 
(15) (*) qi” > 92”. 
MP: 


Since (p192/q1p2) is less than unity, taking logarithms of both sides of 
(15) shows that c is an integer satisfying 


n 


log ee 
(16) Pi 


log = 


q2 


e < 








+1 


Similarly (13) can be reduced to show that c is an integer satisfying 


n 
c> — 1, 
toe = 
(17) P1 


log = 


Q2 








+1 


Combining (16) and (17), we find that c is an integer which satisfies 
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n n 
-l<c< 

P2 P2 

18) log — log — 
( pr 41 Pr 41 

q1 q1 

log — log — 

2 q2 


From (18) it is obvious that c may be obtained by evaluating 


n 


log 5 


(19) A 
log = 
q2 





i 
2 





+1 


and rounding to the nearest integer. If the value of (19) falls exactly 
midway between two integers then equal maxima exist at those two 
integers. 

An alternate and interesting approach for obtaining the acceptance 
number c which maximizes (9) is to approximate (9) as follows: 


P™& n! 0 nt n! 1 nl 
a” tune 


e+1/2 n! 
fp f ccmenisnniainaniaimmenais 217q1"-*dx 
j—1/2 xi(n = x)! 





(20) 
n! n! 
+i« (<— p2°q2" + Ilnm—D! wrer* + --- 
cH/2 on} 
"Su Haat ra-*iz) 


where j may be set arbitrarily equal to 0, or 1, or 2, ---, but at some 
value less than c. The range of integration for a particular term, say the 
(k+1)st goes from k—} to k+4, thus employing a useful device for 
passing from a discrete variable to a continuous variable. Differentiating 
(20) with respect to c and equating the result to zero (the condition for 
& maximum) gives 
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(21) 
n! 


(+ O)Mn —c — 9)! 


Taking logarithms of both expressions in (21) and solving for c yields 





poet 2qan—e- U2 = 0. 


n 





an 
c= —— 

2 P2 

log — 

Pi 


q1 
log — 
q2 


+1 


the same expression as (19), which had been derived exactly. Thus the 
value of c which maximizes the approximation (20), when rounded, is 
equal to the c value which maximizes (9). This justifies the use of the 
approximation and provides an efficient means of dealing with certain 
optimum problems involving the binomial distribution. 

The tables in this paper are based upon the solution developed herein. 


SINGLE SAMPLING PLANS FOR PLACING A LOT INTO ONE OF 
THREE CATEGORIES 


Given quality levels p1, p2, and p3(p1<p2<ps3) for cavegory 1, cate- 
gory 2, and category 3, respectively, with fixed sample size n, we can 
write 


e1 


(22) Pr» (1) = 


ino t!(n — 2) 


"had ll 


n! ! 


c2 cj 
23) Pr,,(2) = > —— a on 
(23) Pro(@) = 2 Tap Pe — 2 Ge aa 


2 n! 
(24) Pr,(3) =1— do nw i 

imo t!(n — 1)! 
where Pr,,(1) represents the probability of placing lots of quality p: 
into category 1, Pr,,(2) represents the probability of placing lots into 
category 2, and Pr,,(3) represents the probability of placing lots of 
quality ps; into category 3. 

Using the criterion discussed in the introductory section, it is desired 

to maximize the following expression: 
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(25) P= Pr»,(1) + Pr >, (2) + Pr»,(3). 


It is evident that, for any fixed value of ce, the solution for c; which 
maximizes (25) is equivalent to the one developed in the preceding 
section. Similarly, for any fixed value of c; the solution for cz is equiva- 
lent also to the solution developed in the preceding section. Thus the 
values of c; and cz which maximize (25) are simply 
1 + n q 1 4 n 
qo=-— and «= —— 
2 De 2 Da 
log — log — 
ee ae. 
71 qe 
log — log — 
q2 qs 








where c; and cz are rounded to the nearest integer. 

Following this line of reasoning, it is obvious that the preceding re- 
sults can be applied to the general case of m categories. That is, if 
Pi, P2,*** Pm are the respective lot qualities of categories 1, 2, 3, 
-++,m, then the c;,(¢=1, 2, 3, - - - , m—1) which maximize 


(26) P = > Pr(t) 
i=l 
are given simply as 
n 


Pitt 
log 
Di 


qi 
log 
Qi+1 


It is obvious from (27) that the tables can be used to obtain any c; by 
entering them with p; and pi41. 
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ON ERRORS IN MATRIX INVERSION 


Pau. S. DwrEer 
University of Michigan 
AND 
FREDERICK V. WauGH 
Bureau of Agricultural Economics 


I. INTRODUCTION 


ATHEMATICAL statisticians have given a great deal of attention to 

sampling errors, much less to errors of computation, and very 
little to errors due to faulty data. Before the statistician has confidence 
in his results, he should explore all three kinds of errors. 

This paper does not consider errors of sampling, since much has been 
written on that subject. A section is devoted to the discovery of the © 
amount of error resulting from the use of approximate computational 
methods, but the paper is devoted primarily to a consideration of the 
effects of errors in the original data—and specifically to the size of the 
maximum error in a particular element of an inverse matrix due to the 
inherent errors in the elements of the original matrix. 

The computation of an inverse matrix is the key to such important 
statistical problems as multiple regression, factor analysis, and the 
Leontief technique of “inter-industry analysis.” The methods used in 
this paper might be applicable to related problems, such as finding 
bounds to the errors in solving simultaneous equations, but it seemed 
best to limit this paper to errors in elements of the inverse matrix. 

Perhaps a few words should be said here about the nature of inherent 
errors (or inaccuracies) in the original data. Seldom can a statistician 
get absolutely accurate data in any field of work. Even in physical ex- 
periments, the data are at least subject to errors of measurement. In 
economics, the statistician must work with estimates of such variables 
as crop yields, price indices, and national income. In all such cases, it 
is extremely important that the statistician have information concern- 
ing the reliability of the data he uses. Unfortunately few statistical 
agencies of the Government, or of the trade, provide such information. 
This situation should be rectified to whatever extent is possible. In any 
case, a statistician should always find out as much as he can about the 
data he uses, and should form at least a rough judgment of the maxi- 
mum errors associated with each series. Without such judgment, he 
cannot be sure that his analysis is valid. 

A recent book [5] and paper [4] by one of the authors give consider- 
able attention to problems related to the subject of the present paper. 
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A formula for the first order discrepancies in the elements of the inverse 
matrix due to inherent errors of the data is presented [5, p. 286]. The 
present paper extends these results 

(a) by presenting methods for establishing a bound for the total dis- 
crepancy, ; 

(b) by showing how the actual extreme discrepancy for each element 
of the inverse may be determined and calculated. 

The results are applicable to the problem in which the elements of 
the matrix to be inverted are subject to unspecified but limited errors. 
Thus each element of the matrix is composed of a known part, a;;, and 
an unknown part, ¢;;, with the specification that | e:5| Sni;. The 7; are 
the bounds for the errors of the elements. 

There are several problems which are closely related to the problem 
of the errors of the inverse matrix. These are 

(a) the problem of the bounds and extreme values of the determi- 
nant of the matrix A, 

(b) the problem of the bounds and extreme values of the roots of 
equations which feature A-, 

(c) the problem of adjusted equations in which the values of the ¢;; 
are known. 

The amount of material we found which seemed necessary to an 
adequate discussion of the topic of errors of matrix inversion was so 
extensive that we decided to treat this specific problem in this paper. 
The other problems suggested above, though closely related to the prob- 
lem of the errors of matrix inversion, are not treated here except as 
they contribute to the subject of this paper, though several references 
to important papers dealing with these other problems are given. 


II. AN EXAMPLE TO BE ANALYZED 


An illustration is next presented in order to clarify the nature of our 
problem. For this purpose, we have chosen a problem of economics and 
statistics which was recently discussed by one of the authors [26]. The 
problem was to determine the least expensive combination of feeds for 
dairy cows which would meet, or surpass, certain stated requirements. 
This is a practical application of linear programming. The study con- 
cluded that a combination of milo, middlings, gluten, and bran was the 
least expensive dairy ration meeting the requirements for total digesti- 
ble nutrients, digestible protein, calcium, phosphorous, and _ total 
weight. The analysis of this problem involves the computation of an 
inverse matrix. We wish to determine the maximum possible error in 
the elements of this inverse matrix. 
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The essential data are given in Table I. 








TABLE I 
The original matrix 0.495 0.275 0.098 0.459 
0.436 0.449 0.264 0.410 
A 0.395 0.557 1.319 0.385 Absolute 
0.423 0.436 0.467 0.467 row-sums 
Maximum inherent errors, 0.0010 0.0005 0.0005 0.0010 0.0030 
0.0010 0.0010 0.0005 0.0010 0.0035 
0.0010 0.0010 0.0015 0.0010 0.0045 
0.0010 0.0010 0.0010 0.0010 0.0040 
Computed inverse, 6.483 5.153 3.311 —13.626 28.573 
—4.798 6.485 —0.841 —0.285 12.409 
Cc 0.694 —1.627 1.084 —0.147 3.552 
—2.086 —9.095 —3.299 14.896 29.376 
Absolute column-sums 14.061 22.360 8.535 28.954 73.910 





The first section of Table I exhibits the original matrix. The elements, 
a;;, represent the proportion of the jth requirement, which can be sup- 
plied by one dollar’s worth of the ith feed. These elements were com- 
puted using average wholesale prices in Kansas City from October 
1949 through September 1950, and using commonly accepted require- 
ments. 

The second section exhibits an estimate of the maximum absolute 
error, 7:;, associated with each element of A. These estimates are not 
exact, and warrant further study, but will serve the purposes of this 
paper. Obviously, an error of at least 0.0005 is possible for each ele- 
ment, since the data were rounded to three decimals. Somewhat larger 
errors are possible because of variation in the nutritive content of the 
feeds. We assume no error in prices. The problem might be stated thus: 
If these feeds could be bought at quoted prices in the 1949-50 feeding 
year, and if we assume feeds of approximately the average quality, how 
much of each of the four feeds would be required? The assumed »;; ap- 
pear reasonable from this standpoint. We also consider the case in 
which 7;;=74=0.0005. This would be an appropriate assumption if it 
were assumed that feeds of exactly average quality could be bought at 
the precise quoted price. 

Finally, the last section of Table I shows the inverse of A, which we 
designate C. It was calculated by a method which first obtains the 
adjoint of A [27] [5, p. 217]. In this way, we can guarantee that the 
computed elements of C are correct to the three decimal places re- 
corded. This value of the inverse, the elements of which are correct to 
three decimal places, can also be computed by the method of Section 13. 
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To the right of Table I are shown the sums of absolute values of 
elements of rows. Below are shown the sums of absolute values of 
elements of columns. These sums are used in our analysis. 

We can guaraniee that C is computed to three decimals of accuracy, 
But we have not considered the possible effects of the inherent errors, 
nij- The main purpose of this paper is to determine bounds, or extreme 
values, to the discrepancies between elements of C,, and elements of the 
inverse of a “true” matrix which is free from inherent errors. 


III. GENERAL FORMULAS FOR THE INHERENT ERRORS OF THE INVERSE 


We first. consider the accumulation of the inherent errors. Let 7 be a 
real matrix which is composed of an approximate part A and an error 
part HZ. Then 


(3.1) T=A+E. 


The matrix E is usually composed of unknown elements, e;;, which may 
be positive, negative, or zero, but which are not greater in absolute 
value than some bound »;;. We desire to find the inverse of T knowing 
the value of A and the bounds for £, or alternatively, we desire to 
know how much the inverse of 7’ might differ from the inverse of A. 
It is necessary of course that both A and T be non-singular. 

In most applied problems the bounds for E are small when compared 
with the corresponding elements of A, i.e., the relative errors of the 
elements of A are small. A major concern of this paper is with this 
situation though some of the formulas are appropriate to the more 
general case. 

In the first part of the paper we assume that A-! can be obtained by 
exact methods, or, at least, that a correct k decimal place approxima- 
tion can be computed. We get immediately from (3.1) writing C= A 


(3.2) TO =(A+ £2)" = [7+ EC)A} = CU 4+ EC). 


If the elements of E are small enough so that any norm! of EC is less 
than 1, it can be shown [25] that T-' can be expressed in terms of the 
convergent series [7, p. 40], 


(3.3) T-! = C[I + EC + (EC)? — (EC)? +--- ]. 
Pre-multiplication by A shows that 
(3.3a) I — EC + (EC)? — (EC)? +--- = AT = (I+ EC)". 





1 See Section IV for a discussion of norms. 
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The accumulation of the inherent errors in the inverse is given by 
the difference between T-! and A-'=C. In this paper we call this the 
discrepancy matrix, D. Then 
(3.4) D = T-!— C = — CEC[I — EC + (EC)? — --: | 
(3.4a) D = T-'—C = — C[EC — (EC)? + (EC)? — --> ]. 





Equations (3.4) and (3.4a) are basic to all the sections of this paper 
dealing with inherent errors. 
From (3.1) with C=A- one has 


D = T-' — A = T-(A — T)A-“! = — TEC 


5 
” = —(D+ C)EC 
so, solving for D 
(3.6) D = — CEC(I + EC)". 


Alternative forms of the above formulas are 


(3.2)’ T-' = (4+) = [A(I + CE)} = (1+ CE)" 
(3.3)’ T-! = [I — CE + (CE)? — (CE)*--- JC 

(3.4)) D=T"-C=-— [I -CE+ (CE)?-—--- ]CEC 
(3.5) D=T-!— A-! = — CET— = — CE(C + D) 
(3.6) D=T--—C = — (I+ CE)"CEC. 


Neglecting terms of (3.4) of order higher than one in E we get: 
(3.7) D, = — CEC, 


where D, is the matrix of first-order discrepancies. 
This formula could be written somewhat symbolically as 


(3.8) d(A-!) = — A-"(dA)A- 


which is a matrix generalization of the formula for d(a—). Other deriva- 
tions of this formula (or of the corresponding derivative formula) are 
available [5, p. 285], [14], [6]. 


IV. THE NORM OF THE DISCREPANCY MATRIX 


We wish to establish an upper bound to the elements of D. One 
method is to use some norm of D. For our purposes we define N(A), the 
norm of the real matrix A, as a quantity which satisfies the conditions 
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(a) N(A) 20 

(b) N(A) = Oif and only if A = 0 

(ce) N(cA) = lc| N(A) where c is a scalar 

(d) N(A + B) S N(A) + N(B) 

(e) N(AB) S N(A)N(B) 

(f) N(A) = 1if A is a fundamental unit matrix (with all 


elements zero except one element which is unity). 


(4.1) 


These conditions are in substantial agreement with those advocated 
by Bowker [2]. Application of these conditions to (3.4a) gives with 
N(EC) <1 








(4.2) (Dp) = NONE) 
1 — N(EC) 
and if N(E)N(C) <1, 
we yo) ¢ XOWOr 
= Teo 


Several norms are available. In this section we feature norms ad- 
vocated recently by Hotelling [11], Bowker [2], and the maximum 
coefficient, related to a norm, advocated by Turing [24]. These are 


(4.4) N,(A) = ( > aut) = (trace A’A)!/?, 
ii 

(4.5) NA) = max >>| axl, 

(4.6) M(A) = max | a;;| . 


7 

It is to be noted that nM(A) isa norm. The M(A) used by Lonseth [13] 
appears to be a norm as does the max; [A;(A’A)]', where A; is the 
characteristic root of A’A, of Wittmeyer [29]. However the M(A) and 
m(A) of Lonseth and the max; [A;(A’A)]"? and min; [\;(A’A)]'” of 
Wittmeyer play the more precise role of bounds. 

We calculate norms for the discrepancy matrix of Table I. We com- 
pute N(Z£) from the 7;; matrix and N(C) from the elements of the in- 
verse matrix. We have 


(4.7) N,(Z) S 0.003873, Ni(C) = 25.60 so Ni(D) S 2.82. 
(4.8) N.(#) S$ 0.0045, N.(C) = 29.376 so N»(D) S 4.47. 
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If the Bowker norm is applied columnwise rather than row-wise, we 
get 
(4.9) N»s'(#) S 0.0044, N.’(C) = 28.954, so N,’(D) S 3.79. 
It can also be shown that when n?M(E#)M(c) <1 ' 
n?M (E)(M(C) |? 
1 — n?M(E)M(C) 





(4.10) M(D) s 


Then 

(4.11) M(E£) S$ 0.0015, M(C) = 14.896, so M(D) S 8.29. 
The use of nM(A) as a norm gives 

(4.12) Na(#) S 0.0060, Nu(C) = 59.584, so Nu(D) S 33.154 


and no element of D is larger than 33.154/4 =8.29 as is shown in (4.11). 

From the calculations above we could use any of 2.82, 4.47, 3.79, or 
8.29 as an upper bound to the discrepancy elements. We are free to 
choose the lowest available bound. Naturally we prefer a bound of 2.82 
to one of 8.29 but even the bound 2.82 is too high for much confidence 
in the precision of the elements of the inverse matrix of Table I. It is 
not surprising though that the norm which requires the most detailed 
calculation is the one which gives us the best results, 

These norms are all relatively easy to compute and in some cases, 
particularly when all the terms of the matrix have approximately equal 
absolute values, may be satisfactory. In general, however, they provide 
rather loose bounds to the discrepancies, d;;. Commonly, as in the 
present case, we need a closer bound for each d;;. We proceed therefore 
to consider the possibility of establishing closer bounds, or perhaps 
lower extreme values, than those found in this section of the paper. 


V. BOUNDS FOR THE ELEMENTS OF THE DISCREPANCY MATRIX 
WHEN THE ERRORS HAVE A COMMON BOUND 


An important special case is that in which the maximum absolute 
error in any element of A is 7. For example, if the elements of A are 
accurate to three decimal places, the value of the error of any element 
is between —0.0005 and 0.0005, and we say 7 =0.0005. . 

A method is available [5, p. 285] for computing the first order ap- 
proximations to the elements of the discrepancy matrix. This is done 
by assigning the elements of F in (3.9) as either —7 or 7 in such a man- 
ner as to make a particular element of D, take on the extreme value. 
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A different set of values of EZ may be necessary to obtain the extreme 
value for each term in the first order discrepancy matrix. The results 
are given by the formula 


(5.1) B[d.1e | = du Cr; 2d! Cis| " 


The calculations are illustrated in Table II where the A and C 
matrices are those of Table I and 7=0.0005. The sums of the abso- 
lute values of the rows and columns of c;; are placed in the margins 
of the first matrix of Table II. These marginal values are multiplied to 
obtain the elements of the matrix. These products are multiplied by 
n ==0.0005 to obtain the values B[d,.,.] which are recorded in the second 
matrix. 











TABLE II 
Products Row Sum 
Products 401.765 638 .892 243 .871 827 .303 28.573 


174.483 277 .465 105.911 359.290 12.409 
49.945 79 .423 30.316 102.845 3.552 
413.056 656 . 847 250 .724 850.553 29.376 


Column sum 14.061 22.360 8.535 28.954 73.910 
B[di: rs] 0.201 0.319 0.122 0.414 

0.087 0.139 0.053 0.180 

0.025 0.040 0.015 0.051 

0.207 0.328 0.125 0.425 
B [d,s] 0.209 0.331 0.127 0.430 

0.090 0.144 0.055 0.187 

0.026 0.042 0.016 0.053 

0.215 0.341 0.130 0.441 





How good are these first order discrepancies? One of the purposes of 
this paper is to show that the results of the method described above 
can be extended, with little more computational work, to provide 
bounds for the actual discrepancy matrix rather than for bounds for the 
first order approximation to it. This can be done if N(EC) <1 and if 
n<1/(D2ds| css] ). 


We show below that a bound for d,, is given by the formula 


nD. C,;| >>} Cis| 


(5.2) B{d,.] = — 


19d. DI ci 
t 7 
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This formula was given essentially, though not explicitly, by Lonseth 
[13, p. 197] who directed his attention primarily to bounds for the errors 
of solutions of equations rather than to bounds for the errors of the 
inverse matrix. The general nature of Lonseth’s proofs should be of 
interest to the more mathematically inclined readers. 

This formula indicates that we need only multiply the first order 
discrepancies by 1/(1—n >: >_,| exj| ) to get discrepancy bounds. Alter- 
natively these bounds may be obtained by multiplying the products in 
Table II by n/(1—n )-s >) ci;| ). Thus the values in the last matrix of 
Table II are obtained by dividing the values of B[d1.r.] by 0.963045. 
Slightly more accurate values, since they avoid the rounding off errors 
of the B[d;.,.] terms, may be obtained by multiplying the elements of 
the first matrix of Table II by 0.000519187. 

In this case the first order bounds underestimate the bounds by 
about 3.84 per cent of the first order bounds. More generaliy the ade- 
quacy of the first order bounds may be measured by the formula 


Bid] _ a 1 r 
B[d:: +. | ~~ p> 2D! c:;| 


¢ @ 
The first order bounds are very unsatisfactory if n >>; | c:3| is nearly 
one and they are of doubtful value unless 7 >>; >>; c;;| <0.1. Equation 
(5.3) implies that » should be less than 1/(10 >>; >.;]¢;;|) if the first 
order bounds are to underestimate the bounds by 11% or less of the 
first order bounds. 
We now prove (5.2). If 7 is the maximum absolute value of any ele- 
ment of Z, then the maximum absolute value of each element of EC is 
given by 


(5.3) 1. 





n>,| ci| >| Cia| +4 n>.,| Cin| ; 
n>| ca] 21>] ce] --+ 1 do] cen 


(5.4)  B[EC] = 





n >| ci| n>| Ci2| Ste n>.| Cin| i 


We note that the rows of B[EC] are identical. This is true also for the 
rows of B[EC]?, B[EC}*, etc. Now B[EC}? may be found by multiplying 
each element of B[EC] by n >: >) c:5| ; B[EC}* may be found by mul- 
iiplying each element of B[EC] by [n >>; > ;lc:;|?, ete. Finally if 
n >.+ >_3| ¢sj| <1 and if we put 


(5.5) k=1—1d DI es 
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we have 


ea Dl eal 2Zleal 2D eal 7 


k k k 
| 


n >| ca| n>| cia — n> 








Cin| 


(5.6) B| x (-— y(Ecy | < k 


n>| ci| n >| Cis| = nd Cin| 
a k k k dj 











and by taking bounds in (3.4a) we get (5.2). 

Table II provides bounds to d;; on the assumption that the maximum 
error in each element of A is 7 =0.0005. Section IV of this paper provides 
norms for the case in which 7,; varies from 0.0005 to 0.0015. We could 
use the methods of this section, with 7 =0.0015 and find the bound for 
the term having the largest possible error. This is at the junction of the 
row and column of C having the largest absolute sums. Thus from 
Table I, we have at once 


(0.0015) (28.954)(29.376) _ 
1 — (0.0015)(73.910) 


Bldu]| = 1.435 





which, though obviously larger than the value of 0.441 in Table II with 
n =0.0005, is appreciably smaller than the bounds established in Section 
IV with the use of norms. The methods of Section V are almost as easy 
computationally as those of Section IV and often, as in this case, give 
much more satisfactory results. 

The bounds above are quite general, applying to any inverse matrix 
and to any assumed error bounds, 7;;. For example, the results to this 
point show that the absolute error in cy of Table I cannot be greater 
than 0.441 if we assume that each 7;;=0.0005, nor can it be greater 
than 1.435 if the maximum 7;; are as stated in the second section of 
Table I. Bounds of this kind can be computed quickly and easily, and 
often should be close enough for our purposes. 

But, in general, it is possible to reduce the bounds in two ways: First, 
by taking account of the signs of error terms; and, second, by using the 
complete error matrix, 7;;, when its elements are unequal. Later sections 
of this paper determine the lowest possible bounds for the general! case; 
—that is, they show the extreme attainable values of d,,, taking ac- 
count of signs of error terms, and making use of the entire »;; matrix. 
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Before doing this, however, we shall briefly consider a special class of 
matrices in which there is no problem of signs. 


VI. A SPECIAL CASE IN WHICH B[d,,] Is AN ATTAINABLE EXTREME 


There are certain special cases in which the bounda ef Section V are 
actually attainable, and thus are the best possible bounds when 7;;= 7. 
One of these is the case in which all elements of C have the same sign, 
i.e., the signs are all positive or all negative. 

The case in which the signs of C are all positive is of current interest, 
since it includes the so-called “Leontief Matrices” used in “inter-in- 
dustry analysis” of “input-output studies.” Recent proofs are available 
(25, p. 150] and [9] that the inverse of the Leontief matrix is composed 
entirely of non-negative elements. 

The positive nature of the elements of the inverses of matrices having 
the mathematical properties of “Leontief” matrices was established in 
1907 by Frobenius [8] and in 1920 by Janet [12, pp. 135-41]. Ostrowski 
[18] [19] developed the properties of these matrices in considerable de- 
tail. Though Ostrowski directed his efforts toward the determination of 
the bounds for the determinants, he also gave excellent surveys of pre- 
vious work including the allied topic of bounds for the errors of the 
solutions of linear equations. 

More recently Taussky [23] has written on these determinants"and 
Woodbury [31] has written on the properties of “Leontief” matrices. 
They provide extensive bibliographies. The stability of the inverse 
Leontief matrix has been studied by Woodbury [31]. 

We first illustrate the fact that if the inherent errors associated with 
the elements of a matrix have a common bound, 7, the methods of 
Section V provide the lewest possible bound to the discrepancy terms 
of the inverse. In other words, in the Leontief case B[d,,] is an attain- 
able extreme. 

In actual practice, inter-industry analyses are usually made by the 
use of Leontief matrices of large order. The Bureau of Labor Statistics 
has constructed such a matrix of order 450, and plans to invert one of 
order 200. However, the principle can be shown by considering the 
simple 3-rowed Leontief matrix in Table III. 

We note that the computed inverse, C,, is composed entirely of posi- 
tive elements. The discrepancy bounds B[d,,] are computed by (5.2) 
using the row-sums and column-sums of C, and assuming 7 =0.005. 
These discrepancies could be actually reached in the extreme case in 
which the error of each element of A is negative. This is shown in the 
final two sections of the table. First we adjust A by subtracting 0.005 
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from each element, then invert the adjusted matrix. The differences 
between the inverse of the adjusted matrix and the inverse of the origi- 
nal matrix are precisely B[d,,] as computed, aside from an error of one 
digit in the first element of the second row and the second element of 
the third row. These errors are due to rounding of both inverses to two 
decimals. 

The principle illustrated with this simple numerical matrix holds for 
any Leontief matrix, no matter how large, as is shown in Section VIII. 
The B{d,.] bound provides extreme values of the discrepancies which 
would be reached if all inherent errors were at their maxima, and all 
were negative. Therefore, the bound cannot be improved in case of a 
Leontief matrix with a common bound to the inherent errors. 


TABLE III 








—0.50 .30 
.00 .30 
.30 1.00 


Leontief matrix, 
A= 


oo ~ 
sss 


Absolute row-sums 
17) 3.34 
42 3.12 
37 2.60 


.86 D> dl aii| =9.06 
.05 | 
.05 
04] 


.305 
.305 
.995 


.82 
77 
41 


ViI. A LEONTIEF MATRIX WITH UNIFORM PERCENTAGE ERRORS 


01 
.61 
.68 


.30 
.06 


Computed inverse, 


onto 
aod 


Q 
ll 
ee 


oo 
— © oO 


Absolute column-sums 
Discrepancy bounds, as- 
suming 7 =0.005 .05 
B [drs ] _ .04 


Adjusted matrix 0.995 .505 





Faas 


—— 
oon 


B = [a;;—0.005] =| —0.405 .995 
—0.205 .305 


| 
“oo eoce eooew 


0.83 66 


Computed inverse of B 1.61 .07 
0.59 .73 





In some practical cases, the bound of an inherent error 7;; may be 
some proportion of a;;. For example, we may set the bound 


(7.1) Nij s ka; 0<k<1. 


Consider a Leontief matrix, with inherent errors bounded by (7.1). 
In this case we can determine a very simple bound to the discrepancies 
in the inverse. Since each element of the inverse is positive, the extreme 
discrepancies will occur when all errors are negative. In this case, the 
“true” matrix would be 
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au(l1— k) an(l — k) -++ an(1 — k) 
72) Qu(1 — k) ao(l — k) +++ aen(1 — k) 
Qu(l—k) aw(l—k)--- 
and more briefly, 
(7.3) T = A(I — kl). 
The inverse of the true matrix would be 


I I 
7.4 ? ie = —ene Al = —— C, 
(7.4) 1-—k 1-k 


which would be obtained by multiplying each element of C by 1/(1—4). 
Therefore, the maximum discrepancy in this case is 


(7.5) B[d,+| = C ; 
~~ ~~ Oh ed 


Cr. 


Thus, if the maximum inherent error associated with any element of 
a Leontief matrix is 5 per cent of that element, the maximum dis- 
crepancy associated with any element of the inverse is 0.05/0.95 times 
that element of the inverse. This is the closest bound which could be 
established, since the discrepancies are extreme values which could be 
reached if the errors were distributed in the worst possible way. The 
bound stated in (7.5) holds for a Leontief matrix of any order and is 
extremely simple to apply. 


VIII. LEMMAS CONCERNING THE EXTREME DISCREPANCIES 


The discussion of the earlier sections is concerned with bounds for the 
first order discrepancy matrix and the discrepancy matrix. Aside from 
the methods of Section VI and Section VII, which are applicable to 
special matrices, there is no guarantee that the bound can actually be 
attained. In this and the sections of the paper immediately following we 
work out somewhat more complicated techniques for finding the best 
possible bounds, i.e., smaller bounds than those described above and 
bounds which cannot be replaced by yet smaller bounds because these 
can actually be attained in some extreme problem. These discrepancy 
bounds we call “extreme discrepancies.” The theory also enables us to 
produce the matrix, A+, which would maximize the discrepancy be- 
tween a particular element of A-! and the corresponding element of 
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(A+£)-—. The derivation is somewhat more involved than is the appli- 
cation, which is very satisfactory for the case of common bounds. 

The solution is based on two lemmas, proved below, which establish 
the fact that a specific sign can be assigned to each element of the error 
matrix, once the c;; are known, so as to obtain the extreme discrepancy 
for any given element of the inverse matrix. The first lemma deals with 
extreme first order discrepancies and the second with extreme total dis- 
crepancies. With the signs determined, it is possible to find the dis- 
crepancy matrix which results from the specified errors in several ways. 

We know from (3.9) that the first order approximation to the dis- 
crepancy matrix is given by —CEC. Formal expansion of this product 
shows that the value of the element in the rth row and sth column of 
the first order discrepancy matrix is —C,EC, where C, is the vector of 
elements in the rth row and C;, is the vector of elements in the sth col- 
umn. The first order discrepancy for the element in the rth row and sth 
column of the inverse can then be written 


(8.1) di:n = — >» y ¥ CrilijCjs. 
. ¢ 


We are free to give the signs to the e,;; so as to obtain the extreme 
di:rs. If we take the sign of e;; as opposite to that of the product c,;cjs, 
then d:.,, is as large (positively) as is possible. Similarly if the sign of ¢;; 
is taken as that of c,:c;s, dir, takes on its extreme negative value. We 
introduce the definitions 


6:;;=1 whenc,; has the same sign as Cj, 
(8.2) 6;; = — 1 when c,; has the opposite sign to that of cj. 
6;;=0 whence,; or c,;=0, 
€i; = 0:3¢;; Where ¢;; = 0. 
Then (8.1) becomes 


(8.3) L [di] = — de p CriD j€: jC jn 


where L[d1.,.] indicates the smallest possible (attainable) value of d1:,s. 
Similarly if 6/;= — 633, we have 
(8.4) L[dizre] = Do Do CriBss’esitje = DY Do CriBiiCje 

i i 


is the largest positive value of d1.rs. 
The extreme first order discrepancy of a given element is not neces- 
sarily associated with the extreme first order discrepancy of another 
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element. Theoretically it is necessary to fix the signs of each e;; in order 
to obtain the extreme discrepancy for some one element. Sometimes 
these signs will also may.imize the discrepancies associated with several 
other elements, or even of all elements. Thus if the values of c;; are all 
positive, all 6;;=1, 6;;/= —1 no matter what the values of r and s. If the 
e;; take on values as large as ;; in absolute value, (8.3) and (8.4) be- 
come 


L{d,: ool sesliiiaes a ie CriDijnisCie 
. 9 


(8.5) L[d:: oll = p p CriDijNijCiey 


while if 74;=7, we get 
Ldn) = — 9D ctu 
. 


8.6) L[d: ool — ie ) Cri sjCie. 
s 7 
We now state Lemma 1. The extreme first order discrepancies of cys are 
given by (8.5) and (8.6). 
Once the signs of the e;; are determined, the values of the other ele- 
ments of the first order discrepancy matrix, though not necessarily ex- 
treme discrepancies, may be computed with the formulas 


di; pq + pv ,¥ CpiDsinijCig 
¢ fj 


dh: pg = F 2D »» CyifijCig. 
* ¥g 


(8.7) 


We next consider the questio” of finding extreme discrepancies rather 
than extreme first order discrepancies. Using (3.5) and (3.5)’ we get 


(8.8) D=T?-—C=T IEC = — CET. 


If we knew the signs of the elements of T-' we could determine the 
signs of E so as to obtain an extreme value of d,,., just as we did of 
dir, above. We do not know the value of T-', but if the relative errors 
are small, the elements of T— are close to those of C. If the relative errors 
are small enough so that the signs of the elements of T— are identical with 
the signs of the elements of C, then the extreme discrepancies occur simul- 
taneously with the extreme first order discrepancies. This is Lemma 2. 
One does not know, of course, at the beginning that this condition of 
identical signs is satisfied. But he computes the greatest possible dis- 
crepancy for each element, with the given bounds for e,;;, and knows 
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that he has the extreme discrepancy for each element if the discrepancy 
is less in absolute value than the corresponding element of C. 

The calculation of this potential extreme discrepancy is accomplished 
in various ways, as is shown in the sections following. We wish to indi- 
cate here that the value of this extreme discrepancy element 


(8.9) L{d,.] = — L[d:..]-[I — EC + (EC)? —--- ] 


is obtained by using the same £ in the series which is used in obtaining 
L{d1.r.] from CEC. This formula can also be written as 


(8.10) D = — CEC + CECEC — CECECEC + - -- 


where £ is taken so as to obtain extreme discrepancies for d,.. 

We now return to the problem discussed in Section VI. We know that 
the elements of the inverse matrix are all non-negative so the signs of 
all e;; terms are taken as non-negative. The extreme negative first order 
discrepancies are obtained by setting e;;=7;;. When 7:;=7, the absolute 
values of the first order discrepancies become 7 >.:,;¢ri9ijCj. Since the 
elements of 6 and those of the inverse matrix are non-negative, this 
becomes 7 >. i¢,i > ,;Cje and this is the equivalent of (5.1). Hence (5.1) 
gives the absolute value of the extreme first order discrepancies. 

If all the extreme first order discrepancies are smaller than the corre- 
sponding values of c, the formula (5.2) gives the extreme discrepancies, 
as is illustrated in Table III and as clarified in sections following which 
treat also the case of unequal bounds. 

The Leontief matrix is not the only special case in which all maximum 
discrepancies are attained at once. Any matrix having an inverse whose 
elements alternate in sign also has this property. More generally, if the 
signs of each row (column) of the inverse matrix are equal to (or the 
negative of) the signs of every other row (column), the matrix takes on 
extreme values, positive or negative, simultaneously. Thus the elements 
of the matrix having an inverse with signs 


+ - = + 
+ ~~ - + 
take on extreme values simultaneously. 
Some elements may take on extreme values simultaneously even 


though not all the elements do, if the signs of the corresponding rows 
(or columns) of the inverse matrix are identical or opposite. Thus the 
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extreme negative discrepancy of ay in Table I occurs simultaneously 
with the extreme positive discrepancy of au. 


IX. THE COMPUTATION OF THE INVERSE OF AN ADJUSTED MATRIX 


Once we know just what are the values of E which give extreme dis- 
crepancies, we can avoid the expansion of a series suggested in the last 
section by using techniques which have been worked out to handle 
adjusted matrices [5, p. 292]. An adjusted matrix is one in which some 
element (or elements) is replaced by some other element (or elements). 
Formulas featuring determinants were worked out by Sherman and 
Morrison [21] [22] and Woodbury [15] for the case of adjustment of a 
single element and for the case of the adjustment of the elements of a 
column or row. These adjustments have been described in matrix form 
[5, p. 293]. A more general type of adjustment has been worked out by 
Bartlett [1] in which all elements of the matrix may be adjusted if the 
matrix of adjustments can be expressed as the product of two vectors. 
Still more general formulas for the adjustment of matrices have been 
given by Woodbury [30]. The attention of the reader is also called to 
the earlier (1936) work of Wittmeyer [29] who used max; [A;(A’A)]!/? 
and min; [A;(A’A)]'/? in obtaining bounds for the errors in adjusted sys- 
tems of equations, and to the recent work of engineers [28] [16] [3]. 

The two general formulas given by Woodbury, which are slightly 
altered here to parallel the formulas of Section 3, are 


(9.1) D= (A+ USV)- — A“ = — CUS(S + SVCUS)“SVC 
or if S is non-singular, 

(9.2) D = (A — US"'V)-! — A-! = CU(S — VCU)"VC. 

These are too general to be useful to us here in their general form, 
though special cases are very useful. For the general problem it might 
be more practical to calculate (A+£Z)—, when E is known, by the 
method of the next section rather than to follow the computation of 
(S—VCU)-, or of (S+SVCUS)—, by the additional matrix oper- 
ations demanded by (9.1) or (9.2). 


These formulas are very useful for the special case in which S is a 
scalar, say o. Then wu is a row vector, v a column vector and 


(9.3) D=(A + uov) — A = — Cuo(o + ovCuc)ovC 
becomes 
(9.4) D = (A + wov)! — A= — 


oCwCl 
1+ ovCu 





306 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 10953 


Proof of (9.4) follows by pre-multiplication or post-multiplication by 
A+EH=A+uov. Following the style of Bartlett [1], one may arrive at 
this formula by means of formal substitution in (3.4). 

This formula for the inverse of an adjusted matrix is very useful in 
obtaining the extreme discrepancy matrix when the errors of the ele- 
ments of the original matrix have common bounds, as is shown in Sec- 
tion XI. 


X. THE RECALCULATION OF THE INVERSE 


The equation (3.2) rather than (3.4) might be used once the values 
of the error matrix which give extreme discrepancies are known. This 
method calls for the calculation of T-'=(A+£Z)" and thence 
D=T-'—C. It is extremely simple in theory and is especially useful 
where machines are available for inverting matrices efficiently. Ideally 
the reduction method should be some exact method such as the method 
of determinants [5, p. 76] though good approximation methods would 
be satisfactory, particularly if corrected to a given number of decimal 
places by the method of Section XIII. It may be necessary to make a 
recalculation for each element of the inverse for which it is desired to 
obtain the extreme discrepancy. 


XI. EXTREME DISCREPANCIES FOR THE CASE OF COMMON BOUNDS 


We are now in a position to improve the treatment of the case dis- 
cussed in Section 5 in that we can produce extreme discrepancies. We 
have three different means of calculation at our disposal. These are 

(a) the method of expansion in series, 

(b) the method of adjusted matrices, 

(c) the method of recalculation. 

We also discuss (d) the treatment of joint extreme elements. We discuss 
them in order. Application is made to the problem of Tables I and II 
with » =0.0005. 

(a) The method of expansion in series. In this case we use the matrix 
expansion (8.10) and compute the matrices CEC, CECEC, CECECEC, 

- ++, until convergence is attained to the desired number of places. If 
nz=n and E=nf@’, the value of D is 


(11.1) D = n[Co’C] + °[Ce’Ce’C] + »°[Ce’'Ce’Ce’C]---. 


The illustration of this method is presented in Table IV, where the 
extreme discrepancy for Cy and the other associated discrepancies are 
computed. The value of the matrix 6’ is obtained by premultiplying the 
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column vector of signs of the fourth column of C by the row vector of 
signs of the fourth row of C. This result is placed at the top of Table IV. 
The three decimal place value of C is placed below 6’ below which ap- 
pears 6’C. The values C@’C, Cé’C0’C, Cé’C0’C'6’C are computed and re- 
corded below. The corresponding elements of these matrices are then 
multiplied respectively by n, 7’, 7? and the results are added to obtain 
the discrepancy matrix. The elements of this discrepancy matrix are 
less in absolute value than the elements of the inverse matrix above so 
we may say that extreme discrepancy of the dy term is 0.438. This is 
slightly less than the 0.441 obtained in Table II. If » were larger, the 
improvement would be greater. 

For this problem the dy term is somewhat smaller in absolute value 
when the values of 6;; are used in place of 6;;’. Thus if H = —7@’ instead 
of 6’, (11.1) becomes 


(11.2) = — 1C0'C + (—n)?C0’C0’C + (—n)?C0’C0’C8'C + --- 
and the values are those of sums of alternating series. Thus in Table IV 


du = — 0.0005(850.553) + (0.0005)?(50504) 
— (0.0005)*(300.10) + --- = — 0.413. 
(b) The method of adjusted matrices. 


The value of EF must be taken, in order to obtain the extreme r, s term 
in the discrepancy matrix as 


E = [es;] = unv 


where u is a column vector having elements with absolute value unity 
with signs of c,, v is a row vector with unit elements, aside from sign, 
and with signs identical with those of c,. The sign of the scalar 7 is not 
yet determined. In order to attain the extreme value, positive or nega- 
tive, the sign of 7 should be taken opposite to that of vCu so that the 
denominator of (9.4) becomes as small as possible. If vC'u is positive, 
should have a negative sign and the formula for the discrepancy matrix 
is 


CuvC a Ce’C 
? 1 — nvCu thes 1 — nvCu 





(11.3) D=-— 


Now since the numerator of (11.3) is the value of D, as found in Section 
VIII, we need only calculate 1—nvCu and divide it into D, to find the 
values of the discrepancy matrix associated with some extreme dis- 
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TABLE IV 
METHOD OF EXPANSION IN SERIES 








—] 
—! 
—1 
1 


—13. 
—0. 
—0. 

14, 


—28. 
—28. 
— 28. 

28. 


— 827. 
—32. 
—8. 
850. 


Ce’Ce’C —49,124 
—1,944 

—512 

50,504 


Ce’Ce’Ce’C - 10-4 — 292 
—12 

—3 

300 


.066 .281 101 —0.426 
.003 O11 .004 —0.017 
.001 .003 .001 —0.004 
—0.068 .289 .104 0.438 





crepancy element. The method is illustrated in Table V where, again, 
it is desired to obtain the extreme value of the du term. The value of C 
is recorded and the column sums are recorded after each element has 
been multiplied by the sign of the corresponding element in the fourth 
column. Corresponding row sums are obtained after the elements have 
been multiplied by the signs in the fourth row. These marginal entries 
are summed after multiplication by the signs of the fourth row (or 
fourth column) to get vCu = 59.378. Since vCu>0, n is given a negative 
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sign. The value of Cé’C is then obtained by multiplying the marginal 
entries, as in Table II. The results agree with the results of the more 
formal methods of Table IV. 

The values of D, are then obtained by multiplication by 0.0005 and 
the values of D by dividing the values of D, by 1— (0.0005) (59.378) 
=(0.970311. The resulting values are in complete agreement with the 
results of Table IV. They are obtained much more easily with the 
method of Table V. 

We should note further that the other set of extreme discrepancies 
(having signs opposite to those of this set) are obtained by using the 
multiplier 


—% — 0.0005 — 0.0005 


1+ Cu 1+ (0.0005)(59.378) 1.029689 





The value of the extreme negative dy term, for example, is 


(— 850.553) (0.0005) 
1.029689 





= — 0.413, 


as shown above. 


TABLE V 








5.153 3.311 —13.626 
6.485 —0.841 —0.285 
—1.627 1.084 —0.147 
—9.095 —3.299 14.896 
- 106 — 6.853 28.954 


.916 195.811 — 827 .303 
.609 7.751 —32.747 
.694 2.042 —8.628 
258 314 850 .553 


.273 .098 —0.414 
O11 .004 —0.016 
-003 .001 —0.004 
.281 101 0.425 


281 101 —0.426 
O11 .004 —0.017 
.003 -001 —0.004 
. 289 .104 0.438 
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(c) The method of recalculation. The method of recalculation, de- 
scribed in Section X, is illustrated in Table VI. The matrix of error 
bounds, with signs assigned so as to maximize the d4, term, is first 
exhibited. This matrix is then added to A to obtain the specific A+ £. 
The inverse is then computed. This computation is not exhibited in 
Table VI but the actual calculation used the method of determinants 
so that the answers can be guaranteed to four decimal places. The dis- 
crepancy matrix, formed by subtraction of the four place value of A-, 
is recorded to three decimal places. It is identical with the three-place 
discrepancy matrix of Table IV and Table V. 


TABLE VI 
METHOD OF RECALCULATION 








—0.0005 —0.0005 —0.0005 0.0005 
—0.0005 —0.0005 —0.0005 0.0005 
—0.0005 —0.0005 —0.0005 0.0005 

0.0005 0.0005 0.0005 —0.0005 


0.4945 .2745 0.0975 0.4595 
0.4355 -4485 0.2635 0.4105 
0.3945 .5565 1.3185 0.3855 
0.4235 -4365 0.4675 0.4665 


6.5488 5.4340 3.4122 —14.0519 

(A+£E)" — 4.7956 6.4966 —0.8369 —0.3015 
0.6944 —1.6243 1.0856 —0.1518 

—2.1539 —9.3842 —3.4025 15.3346 


6.4831 5.1527 3.3112 —13 6256 
— 4.7982 6.4853 —0.8408 —0.2846 
0.6937 —1.6270 1.0845 —0.1474 
— 2.0863 —9.0950 —3.2987 14.8963 


0.066 0.281 0.101 —0.426 
0.003 0.011 0.004 —0.017 
0.001 0.003 0.001 —0.004 
—0.068 —0.289 —0.104 0.438 





The extreme negative value for dy is obtained by this method by 
taking the signs of E opposite to those of Table VI. Another recalcula- 
tion is then necessary. 

(d) Extreme discrepancies associated with the elements. Frequently we 
desire the extreme discrepancy associated with each individual element, 
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rather than the extreme discrepancy of some particular element. This 
can be done, for the case of common bounds, by the method of adjusted 
matrices but the values of the extreme CéC and 1—vCu must be calcu- 
jated for each element unless we have the fortunate situation, charac- 
terized by Leontief matrices, where the extreme discrepancies of many 
elements are taken on simultaneously, as illustrated in Table III. 

The technique is illustrated in the general case in Table VII where 
n=0.0005. The columns of C are added after multiplication by the 


TABLE VII 
TREATMENT OF JOINT EXTREME ELEMENTS 








—13.626 28.573 8.985 18.267 —28.573 28.573 
—0.285 1.131 12.409 —11.839 —1.131 12.409 
—0.147 0.298 —3.258 3.552 —0.298 3.552 
14.896 —29.376  -—18.606 —11.186 29.376 29.376 


. —28.384 57.116 11.924 44.844 —57.116 

22.360 ° —28.660 58.782 43.258 14.062 — 58.782 
6.136 . —28.384 57.116 11.924 44.844 —57.116 
—19.106 ° 28.954 —59.378 -—36.742 —21.166 59.378 


22.360 8.535 28.954 73.910 


401.765 638.892 243.871 827.303 0.971442 0.970609 0.971442 -—0.970311 
174.483 277.465 105.911 359.290 0.994038 0.978371 0.994038 -0.981629 
49.945 79.423 30.316 102.845 0.977578 0.992969 0.977578 -—0.989417 
413.056 656.847 250.724 850.553 -—0.971442 —0.970609 —0.971442 0.970311 0.963045 


0.207 0. 0.126 —0.426 
0.088 0. 0.053 —0.183 
0.026 0. 0.016 —0.052 
-0.213 —0. —0.127 0.438 





signs of each specific column of C to get the matrix below C. Thus 
14.061 is the sum of the elements of the first column of C when multi- 
plied by the signs of the elements of this column, 3077 is obtained by 
adding the products of the elements in the first column of C with the 
signs of the elements in the second column of C, etc. The elements of 
the rows of this matrix are then multiplied by the corresponding ele- 
ments of the rows of C and added to get the respective values of vCu, 
which are placed in a matrix at the right of this second matrix. These 
values are checked by the matrix at the right of C which is obtained 
with the use of ow sums rather than column sums. These four matrices 
are bordered with the sums of the absolute values of the elements in 
the rows and columns of C, and with the sum of the absolute values of 
ci; which is presented in the lower right. This portion of the table corre- 
sponds to a part of Table II. 





312 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1953 


The values of the extreme elements of CéC for each value of e;; are 
then computed by multiplication of the diagonal term of the row sum 
matrix by the diagonal term of the column sum matrix, or what 
amounts to the same thing, the values of >>;|¢,:| >o;|¢;s| are com- 
puted and agree with those of Table II. The values of 1 +nvCu are then 
computed and are placed in a matrix at the right of the CéC matrix. A 
transposition of the (vC'u) matrix is effected so that the elements of the 
recorded matrix correspond to the elements of the C@éC matrix. 

The actual values of the extreme discrepancy matrix are then com- 
puted. A negative sign is assigned to certain elements since, in this 
case, vC'u is negative and the element of the discrepancy matrix must be 
negative in order to attain this extreme value. 

In this problem, since 7 is small, these results are not much better 
than those of Table II, and they involved considerable additional cal- 
culation. In situations such as this, the method of Table II is to be 
recommended. In other situations, particularly where 7 is large and 
when the values of vCu differ appreciably in absolute value from 
>: Dsl exs|, the method just described usually gives much better re- 
sults. 


XII. EXTREME DISCREPANCIES FOR THE CASE OF UNEQUAL BOUNDS 


The methods of the last section are useful in treating the case of 


common bounds but they are not all so useful in handling the general 
case of unequal bounds. In particular, the method of adjusted matrices 
indicated in the last section is not usually applicable since the E matrix 
cannot usually be written in the form uCv. Though the matrix of ex- 
treme errors 


0.0001 0.0002 0.0003 
E =| 0.0002 0.0004 0.0006 
0.0003 0.0006 0.0009 


can be written as 
1 
E =| 2 | [0.0001][1 2 3], 
3 


the matrix of errors of Table I cannot be so written. What can we do 
in this case? 
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One answer to this question is to replace each 7; by the largest 
nj=1 and then use the methods of the last section. We might take 
7=0.0015 rather than 0.0005 in Tables IV, V, VI and VII and thus 
obtain answers to the problem of Table I. For example, we get from 
Table V 


(0.0015) (850.555) 
1 — (0.0015)(59.378) 





These results give bounds, rather than extreme values, and we do 
not know how much these bounds might be improved. We are still 
faced with the problem of finding the extreme discrepancies. 

Perhaps the simplest method, at least in theory, to use is the method 
of recalculation. The signs of the error terms are determined as in the 
last section. Using the »;; of Table I we find that the inversion of 


0.4940 0.2745 0.0975 0.4600 ; 
0.4350 0.4480 0.2635 0.4110 
0.3940 0.5560 1.3175 0.3860 
| 0.4240 0.4370 0.4680 0.4680 _ 








results in 

(A+E)u=15.3156 so that subtraction of Cu=14.896 gives du 
=0.419. This is considerably smaller than the value, 1.401, obtained 
with the method of common bounds. 

This method of recalculation may be considered impracticable by 
some, and some alternative method may be preferred. However, it 
should be mentioned here that this method of recalculation may be- 
come preferable to other methods as improved techniques of calculation 
are made available and as machinery is developed for computing in- 
verse matrices quickly. 

Alternatively we may use the method of expansion in series. The 
computation is somewhat more complicated here, however, for the size 
of the bound of each element must be taken into consideration at each 
step. Since the values of the errors are usually of the same order, we 
find it convenient to use a measure of this order as a scalar multiplier. 
Thus the matrix of errors of Table I, with the signs attached so as to 
obtain the extreme positive value of du, can be written 
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2 2 1 —2 
2 232 3 +3 
L-2s -~3 <3 2 


[0.0005 | 








or 
(12.2) E = Hn. 


where 7 is the unit of measurement. 
The series then becomes 


(12.3) D = nCHC + 1°CHCHC + »°CHCHCHC :-- , 


and the expansion is carried out as in Table IV. 

The method of adjusted matrices, as described above, is not satis- 
factory in this case since in general the extreme error matrix cannot be 
written as the product of two vectors. 


XIII, MAXIMUM EFFECTS OF COMPUTATIONAL ERRORS 


Many methods of computation do not lead to exact values of A?=C, 
or even to values which can be guaranteed to a fixed number of decimal 
places, but rather to values which are admittedly approximations to C 
and which, it is hoped, may be used in place of C. If the approximate 
matrix is denoted by Co, what discrepancies between C5 and C are per- 
missible? 

In a paper mentioned above, [26], one of the authors used the method 
of submatrices in arriving at a five decimal place approximation to C. 
This approximation, rounded to three decimal places, is 


6.485 5.152 3.311 —13.625 7 28.573 
—4.798 6.485 —0.841 — 0.285 | 12.409 
0.694 —1.627 1.084 — 9.147 | 3.552 
—2.086 -—9.095 -—3.299 14.896 | 29.376 
14.063 22.359 8.535 28.953 73.910 


(The sums of the absolute values of the row and column elements are 
placed in the margins for later use.) Is Co close enough to C so that it 
can be used in answering questions of the type discussed in the earlier 
sections of this paper? Can a bound for the computational errors be 
found? Can we find a matrix of computational errors which, when 
added to Co, gives C to three decimal places? 


(13.1) C= 
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The question of the accumulation of rounding-off errors has received 
extensive attention in the literature of the last decade. A complete 
survey of this material is hardly appropriate to this article but a few of 
the techniques should be mentioned. Hotelling [11] proposed that an 
approximation Cy) might serve as the basis of an iterative method which 
continues until the result is correct to the desired number of places. 
Satterthwaite [20] has proposed a type of error control in which the 
norm is reduced to a very small amount by matrix multiplication; von 
Neumann and Goldstine [17] have studied the accumulation of compu- 
tational errors in what we call the method of single division [5, p. 99]; 
Turing [24] studied the accumulation of errors of several common re- 
duction methods using the maximum coefficient. Hotelling [10] pro- 
vided formulas using norms which give bounds to the difference be- 
tween C and Cy and between C and C,, where C,, is the improved ap- 
proximation to A- resulting from m iterations. Following the style of 
Hotelling, but not his precise notation, we write the matrix of compu- 
tational errors as 


(13.2) A=C-—C,) = C(I — ACy) = CG. 


Using norms we get 
(13.3) N(A) S N(C)N(G). 


Unfortunately we do not know N(C) but we can develop an upper 
bound for it if N(G) <1. Since 


(13.4) C=C[AG)'=O[1 -Gl'=C[/l+G+a?+--- ], 
we have 
(13.5) A=C-Q=C(44+4+G+---] 


sO 


N(Co) 
(13.6) N(C) = i-k 


N(Co)k 
13.7 N(A) = , 
(13.7) (4) =—— 
where k= N(Q). 
This formula is given by Hotelling [10, p. 281]. A similar formula using 
M(A) is given by Turing [24]. In our notation his formula is 
nM (Co) M(G) 


(13.8) M(A) = i —mM(G) 
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We apply (13.7) and (13.8) to the problem above. We compute 
G=I-—AC, and get 


—1163 436 339 —1087] 2046 
—1114 441 427 — 87 | 2069 
—1365 403 911 —447 | 3126 
—1163 418 528 —148_ 2257 


(13.9) 10°-G = 








with 


Ni(Co) = 25.598, Ni(Co) = 29.3876, M(Co) = 14.896 
(13.10) n( 0) o( 0) ( 0) ) 


N,(G) = 0.002860, N.(G) = 0.003126, M(G) = 0.001365 


so 
(13.11) N,(A) < 0.073, Ns(A) < 0.092, and M(A) < 0.082, 


and it follows that no element of Co differs from the corresponding ele- 
ment of C by more than 0.073. This is less than many of the extreme 
errors attributable to inaccuracies of the original data (see Table VII) 
when 7 =0.0005 so there may be little point in making closer approxi- 
mation to A! unless it is possible to state the original matrix more ac- 
curately. 

If the approximation, Co, were not so good, i.e., if N(A) were con- 
siderably larger, it might be wise to obtain a closer approximation to C. 
Hotelling suggests 


(13.12) C1 - C, [21 ies ACo| - Co + CoG 
as the next approximation. Then 
(13.13) A =C-Q,=C-—-C)—-—CoG=A— CG 


and the difference between the approximations is CoG. 
This process can be extended. If it is extended through m iterations 
we can arrive at Hotelling’s formula 


N(Co)k?” 


(13.14) N(An) = ey 


Hotelling points out that the 2” exponential of k means that the number 
of sure decimal places is doubled with each iteration. 

Sometimes we wish to provide the A which, when added to Co, gives 
the value of C to a specified number of decimal places. Though this 
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can be done by iterative methods, such as the method proposed by 
Hotelling, it can also be accomplished with the use of (13.5) if Co is 
close enough to C' so that the series converges. The method is effective 
when C> is a good approximation to C, as in this case one needs to 
compute but a few terms of the series to obtain the correct values. This 
situation usually exists when direct methods or build-up methods are 
used in obtaining Co. 

In many cases, a single term of the series will be sufficient. This is 
true in the illustration above since CoG’, CoG’, - - + are too small to 
make any contribution to the third decimal place. It follows that 


—0.002 0.001 0.000 -—0.001 
0.000 0.000 0.000 0.000 
0.000 0.000 0.000 0.000 
0.000 0.000 0.000 0.000 


(13.15) A = CoG = 


Addition of A to Co gives the value of C correct to three decimals. 

The formula for (13.5) can be used in another way. Following the 
type of argument of Section 5 for inherent errors, we let gi, ge, - * + , Yn 
be the maximum absolute elements in columns 1, 2, - - - , n of G. Then 


‘Gn 


Each row of B(G?) is g: >.9, 92 >.9, °°» ng. Each row of B(G?) is 


g( og)? go Dug)? 4, «+ +, ga Dag)? 
The matrix }>>_,B(G)?) is composed of the rows 


Jn 


——_-- —-— 


So 06° 1-9 


9g 


1 
inde i« 


9g 





so that 


(13.17) extreme A,, = >.| Co: se — Sn 
k 


1-dg 


Application of (13.17) using (13.1) and (13.9) yields the result 
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- 0.039 0.013 0.026 0.0137 
0.017 0.005 0.011 0.006 
0.005 0.002 0.003 0.002 
L 0.040 0.013 0.027 0.013 J 


(13.18) B(A) = 








which, while not as good as (13.15), is much better than the results of 
(13.11). 
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A MULTIPLE GROUP LEAST SQUARES’ PROBLEM AND 
THE SIGNIFICANCE OF THE ASSOCIATED 
ORTHOGONAL POLYNOMIALS 


BRADFORD F. KIMBALL 
New York State Department of Public Service 


A multiple group least squares’ problem is presented and 
solved which involves determination of certain coefficients 
separately for each of several samples with a final coefficient 
given a single optimum value for all samples grouped to- 
gether. 

The computation of orthogonal polynomials implicit in the 
normal equations is discussed relative to the forward solution 
of these equations. Certain significant advantages of comput- 
ing orthogonal predictors are pointed out. 

A numerical example of the computation of the orthogonal 
polynomials in the case of a weighted fit is presented with ap- 
plication to determination of the confidence band of a fitted 
trend curve. 


I. A MULTIPLE GROUP LEAST SQUARES’ PROBLEM 


UPPOSE a cost-distance relationship is sought in a study of motor 
. carrier costs. Cost data from samples of several companies are 
available but they are from different regions and apply to different 
weight ranges of shipments. A quadratic polynomial distance function 
is to be fitted by least squares, where the distance variable z is taken 
as the square root of the distance, and the dependent cost variable is 
cost per 100 lbs. 

There is reason to take the coefficient of the squared term as con- 
stant over all samples, while the linear coefficients are to be fitted to 
the separate samples. How does one solve the problem? 

The answer is furnished by the following analysis: In the interests 
of simplicity limit the problem to three samples. One then seeks three 
cost-distance functions of the type 


aD + Be + yz" 
(1.1) u® = q@® 4+ By + yz? 


yu 


which give a best fit to the data in the least squares’ sense. 
The costs occur as three series y;“, s=1, 2, 3 with N“™ values in the 
sth series. The moments are designated by 


(1.2) ax) = S[zijzn; |, aio” = S[ziiy; | 
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where 
2; = 1, 20; = 2;", 23; = (a;)?. 


Thus the matrix of the normal equations for each sample considered 
separately will be 


(1.3) lla a2 ais, aio||, i= 1, 2,3. 


For each sample we first neglect the squared term in z and obtain 
solutions 


(1.4) U®) = B® + Bz 

of the normal system 

(1.5) laa ai, aso], i=1,2. 
As the next step determine a pair of solutions A;™, A2™ of the system 
(1.6) laa ain, aix||, +=1,2 


where the moments of the dependent variable on the right have been 
replaced by the moments involving 23. 

It will be found that the first two equations of the system (1.3) are 
satisfied by the three coefficients 


By — tA,, B® — tA, ¢t 
for any value of t. Thus the two following equations 


an” (By — tA 1) + 32) (Be? ~— tA,“?) + ay3t = ayo 


1.7 
9) an (By — tdy™) + aoe (Bo — tAe) + ast = aro 


will be satisfied for each sample, for any value of t. Then the quadratic 
curve, for any constant value of t, 


u(t) = By) — tA, + (By — tAs)a + tx? 


will have linear terms which will be the optimum terms in the least 
squares’ sense for each s for the sample in question. This means that 


(1.8) aS®/a(B© ick tA,) _— dS® /d(B, — tA,“)) = 0 


where S“) denotes the sum of the squares of the residuals for the sth 
sample. 

Thus our problem will be solved if ¢ can be chosen to give an opti- 
mum value for all the samples. This optimum is taken as the value 
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which will minimize the sum of the squares of the residuals over all the 
samples lumped together. 

Now for a single sample the total derivative of the sum of the 
squares of the residuals with respect to ¢ can be shown to be 


(1.9) dS) /dt = — 2(Y3 a“ g3t) 
where 
qs = Q33 _— Aya, —_ Az azo 


(1.10) 
Y; _ A390 acs Bag, on BL az.™. 


Using (1.8) this is proved as follows: Omitting the superscripts, write 
u(t) = P(t) + zQ(é) + tz’, 
P(t) = B, — tAi, Q(t) = B. — tA: 
and the total derivative of S as to ¢ is given by 
dS 0S dP dS dQ a 


| 


dt oP dt 0Q dt a 


where the last partial derivative involves ¢ only as coefficient of 2? in 
u(t). From (1.8) 


dS/aP = d8/dQ = 0. 
Hence 
dS/dt = dS/at = 2>> (u — y)z? = — 2(>5 2’y — DS xu) 


which easily reduces to (1.9). 


Now note that (cf. (1.7)) 
— (¥) — gaz 
(1.11) (Y3 gst) 
= aa (Bi — tdi) + ag (B2 — tA2) + (asst — ago). 


Using (1.9) the derivative of the sum of the squares of the residuals 
taken over all samples is given by summing the expression on the right 
over all samples with ¢ constant. Setting this equal to zero 


Y; + Y,(2) 4+ Y,(2 
ga + gs + gy 
Hence the requtred coefficients of the quadratics (1.1) are given by 
(1.13) a) = B® — 7A,, B® = Bi) — yA, 
where is determined by (1.12). 





(1.12) fT=7= 
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II. ANALOGY WITH METHOD OF ORTHOGONAL POLYNOMIALS 





Light is thrown on the problem of finding the sampling variance of 
y by bringing out the orthogonal relations inherent in the method. The 
method of “orthogonal polynomials” would proceed by first determin- 
ing orthogonal functions ¢; of the variables z; (see (1.2) above). The 
requirement is that 


(2.1) S¢id2 = Sdids = Sd2¢3 = 0 


where the summations are taken over the N values of the independent 
variables 2;; which correspond to the data series on y; in a particular 
sample. The author has shown [6, Sec. 3] that a set of such orthogonal 
functions may be taken as 


(2.2) oi = 41, 2 = 22 — Ma, gs = 23 — Ase, — sia2e 
where 
(2.3) M = ay:/ay 


and A, and A: are defined by (1.6) above, for a single sample. 
The solution is sought in the form 


(2.4) U = tidi + tehe + tabs 
and the normal equations for a single sample reduce to 
(2.5) tiSgo:? = Sdiy, tao? = Sooy, — ts Sa” = Sosy. 


An examination of the summation products S¢;? and S¢3y shows that 
for a single sample 


(2.6) S¢3? = Qs, Sdsy = Y3 


where g3 and Y3 are given by (1.10). This is proved in the general case 
in [6], formulas (4.5), (4.6) and discussion of (3.2). 

Hence if one denotes by ¢;® the orthogonal functions applicable to 
the sth sample, one can write 


Sosy ote So3y ok So3y® 
(2.7) Y= 





q3‘?) + q3° + qs) 


where the denominator involves only the independent variables 2;. 
Thus one can proceed to find the sampling variance of y as follows: 
The first order variation in y due to variations in y; is given by 
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(2.8) by = [Sos dy + Sos dy + Sos dy ]/Q 


where for brevity Q denotes the denominator of (2.7). Making the usual 
assumption that the y; are independent with constant variance, 


E[(5y)?] = E[(sy)?][S(os)* + S(ba)? + S(ga‘ ®)*]/Q? 
which by (2.6) reduces to 
E[(5y)?] = E[(sy)*]/Q. 
Hence the estimated sampling variance of y is given by 
(2.9) Est. Variance of y = V/(q3\ + qs‘ + qs‘) 


where V equals the sum of the squares of the residuals divided by 
(N®+N®+N®—7). This last denominator represents the proper 
number of degrees of freedom since it is the total number of elements 
in the three samples (see (1.1) above) decreased by the total number 
of coefficients determined in the fitting process. 

The above method can be applied equally well if different statistical 
weights are assigned to the observations on the dependent variable. The 
only difference in procedure would be that the moments would be 
weighted moments, and that one would deal with a wetghted sum of the 
squares of the residuals. The formulas (2.1), (2.5) and (2.6) would in- 
volve weighted sums of the orthogonal functions, but the determination 
of the explicit form of the orthogonal polynomials (2.2) would remain 
unchanged since the weighting process would be included in the pri- 
mary moments which determine M, A; and Az. 

Problems of the type discussed in Sections I and II might very well 
occur in many other fields of application of least squares’ theory. The 
essential characteristics of the problem are: A determination of a set 
of coefficients of fitted functions so that they separately give optimum 
fit to each particular sample; a determination of another coefficient so 
that it gives optimum fit to the group of samples lumped together. 

The procedure can be further generalized to apply to two or more 
sets of coefficients, where there is more than one coefficient in the 
second set, and to cases where the grouping of samples is performed 
more than once. This generalization involves some changes in pro- 
cedure and may be treated by the author in a later paper. 


III, SOLUTION BY MEANS OF THE DOOLITTLE ALGORITHM 


The Doolittle algorithm supplemented by columns used for comput- 
ing the elements of the reciprocal matrix, furnishes an efficient means 
of computing the necessary quantities in the preceding problem. 
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We use a slight modification of Dwyer’s notation and first write the 
moment matrix supplemented by the array for computing elements of 
the reciprocal matrix, for three variables. This is termed the reference 
matriz in Schedule I below. This is followed by the computation algo- 
rithm where for brevity Dwyer’s “secondary” subscripts have been 
omitted from the b’s and the g’s, and a zero subscript has been used to 
denote the dependent variable [2, pp. 99-118 and 170]. 


SCHEDULE I 
Reference Matriz 


23 28 
ais a3 
ae: ass 

as 


Computation Algorithm 





aio au aiz ais 


bio 1 bis bis 





920 922 92s 0 
bso 1 bes ow 





S(III) Js Jxs 1 
III’ bso 1 — 


For the purpose of determining the solution in terms of orthogonal 
polynomials it is not necessary to carry the primed rows through the 
auxiliary part of the computational schedule. The orthogonal poly- 
nomials are given explicitly by the elements in the summation rows 
S(I), SWZ) and S(III). Thus 
(3.1) oi = 41, go: = 22 — Ma, os = 23 — Ajzi — Ase. 


The multipliers which apply to these orthogonal functions are the b’s 
of the first column. Thus the fitted function is 


(3.2) U = diodi + boob: + bods 

and for 

(3.3) u = hz, + hoz, + h3zs 

the forward solution for the three variable fit is 
hy = bio — Mbao — Aibso 

(3.4) h,® = beo — Asbso 
hs® = bso. 





326 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1953 


For two variables omit the third column 

It has been shown [6, formula (4.5) and discussion of (3.2)] that the g 
elements in the summation rows (not including the auxiliary schedule) 
are related to the orthogonal functions as follows: 
(3.5) gro = Sy¢r, Grek = Sribr, 9rr = S¢,? 
where summations are taken over range of observations fitted. 

Thus from (2.6) 
(3.6) gs = Sos? = gs, Y; = Syds = 930. 


Hence the solution sought, in terms of the notation of the Doolittle 
algorithm is 





- gso? + gao?? + gro 
7 ga + gas?) + gaa) 


where the g’s are determined from separate Doolittle algorithms carried 
through for each sample, and 


a = bio) — Mb — yA, 
Bo = boo? om vyA;, s= 1, 2, 3 


(3.7) 


(3.8) 


give the first two coefficients for each sample, adjusted for +. 


IV. ADVANTAGES OF COMPUTING THE ORTHOGONAL FUNCTIONS 


We have seen in the last section that the explicit determination of a 
set of orthogonal functions inherent in the least squares’ solution is 
possible by supplementing the array of moments used in the reference 
matrix of the Doolittle algorithm by the identity matrix (with diagonal 
elements equal to unity and all other elements equal to zero). This proc- 
ess is entirely general and will serve to determine a set of orthogonal 
functions for any number of independent variables or “predictors.” 

One may well ask why it is not desirable to complete the computation 
for the elements of the reciprocal matrix. In some cases this would 
probably be desirable, for example in a problem of linear correlation 
where the number of predictors is fixed [4]. However, in many cases 
one is experimenting with the question of what predictors are suitable, 
and hence a process by which new variables can be introduced or 
dropped with a minimum of computation is highly desirable. As we 
shall presently see, the computation of the coefficients of the orthogonal 
functions offers an efficient means for such a process of iteration or 
“build up.” 
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We introduce here a general notation. The general problem is taken 
as that of fitting a linear function u of N observations on a set of n 
variables 2; (see (1.2) above) and the matrix of the normal equations is 
given by (1.3) with no superscripts since we deal only with one sample. 
The fitted function is written 


(4.1) Un = Biz, + Boze + B23 +--+ + BM2q 


where the superscript here denotes the number of predictors used in the 
fitting process. Thus if only three are used, 


Us = Biz, + Boze + Ba zs. 


The same function expressed in terms of the orthogonal functions is 
written as 


(4.2) Un = tigi + tebe + babs +--+ + taba 


and in this case the solution for three variables is given by using the 
first three terms of the same series. 

Denote by a; the coefficients which determine the orthogonal func- 
tions. Thus 


fi = %1 
g2 = 22 — az, 
(4.3) os = 22 — a2, — 


Gn — Z. nie ay"-Dz, om ag*—Yz, — e«cc = an ™—Y2,. 
From the above it follows that 
By” ss ty — tea) = tsar ‘2? meee «= tna *—) 


(4.4) ° 


ba—1 — baQtn—1 
tn 


where the multipliers ¢; are the same as the b elements taken from the 
first column of Schedule I. Using the abbreviated Dwyer notation, 


(4.5) ti = Dio, te = bao, +++ , tn = Dno. 


We ave seen in the last section that the coefficients of the orthog- 
onal functions for the first three variables are given by the elements in 
the “summation” rows of the auxiliary schedule of the Doolittle algo- 
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rithm. This is known to be true in the general case [6, Sec. 3]. Hence in 
terms of our general notation the elements in the summation rows of 
this auxiliary schedule appear as 
SCHEDULE II 
Auziliary to Doolittle Algorithm 


Row a 22 23 2n 

S(I) 1 0 0 o 0 

S(II) —ay,) 1 0 see 0 

(4.6) S(III) —a,® —a,® 1 : 0 
S(n) —a-D ag") a, (*-) 1 


The relationships (4.4) to (4.6) offer precisely the “forward solution” 
suggested by Dwyer [2, p. 170]. In other words, if one uses only two 
columns in the array (4.4) the solution of the two variable problem is 
obtained. Addition of the third column gives the three variable solu- 
tion, and so forth. 

If one uses the fact that the ¢’s are the multipliers of the orthogonal 
functions in the solution (4.2), one realizes that the ?’s are statistically 
independent in the sense of having zero covariance over the range used 
in fitting. Assuming no errors in the predictors, the sampling variance 
weight of ¢, is known to be the reciprocal of the antecedent gi (see 
Schedule I). Hence the variance weight of any coefficient 8, of the 
n variable problem is given by 


Variance Weight of 8: = 1/gee + (ax )?/gestepi t-°° 
+ (arn —”)*/gnn. 


The advantage of this formula is that to find the variance weight of 
the coefficient 6,"—» of the same predictor, when another prediction 
variable is added, one merely adds the term 


(4.7) 


(on™ )?/Qntant- 


A similar formula can be set up by the researcher for the covariance of 
two coefficients. 

As more predictors are added, the variances of all coefficients 6; 
are tabulated and partial correlations can easily be found from the 
following relation. Denote by S” the sum of the squares of the residuals 
after fitting a linear function of n predictors. Let S,““—- denote the sum 
of the squares of the residuals when the kth predictor z, is omitted in 
the fitting. The following equation then holds [1, p. 163). 


(4.8) S.-Y) = S + (6,™)? divided by Variance Wt. of 8, . 
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From this the partial correlation of y relative to z for the n variable 
solution can be computed from the relation [3, p. 180] 


(4.9) 1 = roe ar-n = [S™/Sp"- (N — n — 1)/(N — 2). 


The above analysis does not explicitly involve the orthogonal func- 
tions and thus does not involve new theory. The results are obtained 
simply from a study of the relations (4.4) above and constitute an 
argument for the computation of Schedule II. 

An explicit result following from the fact that the computations sum- 
marized in Schedule II give the orthogonal functions is that in the case 
of the linear problem the orthogonal functions furnish a new set of 
predictors which are linear transforms of the original variables, and 
possess zero covariance over the range of observed values used in the 
fitting process. In this connection it should of course be recognized that 
these orthogonal functions depend upon the order in which the original 
variables have been considered. For example, if z, were taken as the 
first variable in the fitting process, we should have ¢;=z,. Hence any 
change in the order in which the predictors are considered, means a 
change in the corresponding set of orthogonal functions computed by 
the above method. 

Knowledge of the explicit nature of these “orthogonal predictors” 
may throw light upon the nature of the original variables used. Indeed, 
in researches in psychology and economics it has been found that the 
determination of such “principal components” offers a starting point 
for simplifying inherent structural relationships. 

Analysis by means of orthogonal functions and their relation to the 
Doolittle algorithm makes it much easier to generalize the type of prob- 
lem discussed in Sections 1-3 and offers a solution which can be carried 
out in a systematic manner. 

Perhaps the most obvious application is to problems of fitting of 
polynomial curves which are to be used for purposes of forecasting. In 
such cases it is often desirable to estimate the variance of the fitted 
polynomial at several points on the curve for the purpose of determin- 
ing a sampling confidence band. 

For curves of any complexity, say of the third or fourth degree this is 
simpler to do in terms of the orthogonal functions implicitly used in the 
fitting process. Recalling that the reciprocals of the gi, elements in 
Schedule I constitute the variance weights of the multipliers t; of the 
orthogonal functions, which are now orthogonal polynomials of a single 
variable, say z; the estimated variance of the fitted polynomial u at any 
point 2; is approximated by (cf. [5], formula (21)) 
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Su? = Valoi?/au + o2°/ge2 + +++ + bn?/Gnn] 


Vn = S™/(N — n) 


applicable to a polynomial of (n—1)st degree. 
With the orthogonal polynomials determined by Schedule II, where 
now 


4: = 1, 22 = 2, 2g = 2, +++, Zn = Zr, 


their values at a series of points can be systematically computed with 
considerable ease, and the resulting numerical values are then squared. 
There remains merely to divide each one of these n values by the corre- 
sponding gx, taken directly from the Doolittle algorithm. The sum 
gives the variance weight s,?/V,. If the degree of the polynomial is in- 
creased by one, the new variance weight involves simply the addition 
of the term 


Gn4t?/Gnttnt 


to what has already been computed. 

The other alternative is to find the covariances and variances of the 
coefficients 6," by completing the computation of the reciprocal ma- 
trix and apply them to squares and cross products of the original varia- 
bles at each point. This is not only a much more complicated process, 
but also it will have to be repeated from the beginning, if a polynomial 
of another degree be fitted. 


V. NUMERICAL EXAMPLE 


As a numerical example we have used a problem presented by Snede- 
cor to illustrate the method of orthogonal polynomials [8]. Data are 
taken over 11 consecutive days on weight of chick embryos and a suc- 
cession of polynomial curves to the fourth degree are fitted. We shall 
use the same data with the exception that we shall suppose that in- 
advertently no weight was recorded on one of the days (taken as the 
11th day which happens to be at the central position). We shall use a 
weighted fit with statistical weights equal to unity assigned to days 
when weights were recorded, and a statistical weight of zero assigned to 
the 11th day when no weight was recorded. 
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Data to be Fitted 


Day: 6 7 8 9 10 11 12 13 14 15 16 
y: 29 52 79 125 181 — 425 738 1130 1882 2812 
z: 10 9 8 7 6 5 4 3 2 1 0 


This procedure enables one to use “factorial moments” in setting up 
the matrix of the normal equations. We assign values of x such that 
z=0 on the 16th day, at the extreme where y is large, and thus x= 10 
corresponds to the 6th day which is the first day that an observation 
was recorded. Thus there is no value of y at 2 =5, and statistical weight 
w=0 at z=5. 

We use Lipps factorial moments [7] which might be termed “re- 
duced” factorial moments or “reduced cumulatives.” They represent 
successive cumulations from z= 10 towards x =0 where each successive 
cumulation omits the last subtotal in the previous cumulation. The 
moments of the weighted independent variable z are 


S=d)w=10,, S= r(*) = 50, 
Zz 
S: = xw(*) = 155, ete. 


In the case of the dependent variable u the moments are 


Mo = > wy = 7.453, My = Dw(*)y = 11.407, 


x 
M:; = Du(F)u = 16.623, etc. 


Since the major interest is in the plotting of the curves, the matrix of 
the normal equations is set up to determine successive differences of the 
polynomials at x=0. We fit 


ts = Bi) + Br (2) + B® ey + B® (*) + B® (;) 


with 


4i=1, z = (z), a-(¢), a=(*), a-({). 


The moments involving the dependent variable in the matrix are then 
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a= Dw,” 


The moments of the independent variable in the matrix are 


Zz Zz 
” las ’ k,t=1,2,-++,5. 
ax rw," G2 ) 1 5 


“EMCEE 
OMENS) 
+723 , we )°* 


with rSs, the matrix elements a; are easily resolved in terms of the 
reduced factorial moments S;. For example 


Qi =— a1 = Si-1, au = 4S, + 30S; ot 60S. +. 3582, ete. 


Substitution of the values of the reduced factorial moments S; yields 
the following matrix for the normal equations: 


Jy = as, k=1,2,---,5. 


y 22 23 % 2 
7.453 50 155 320 457 
11.407 360 1270 2788 4133 
16.623 —_ 4817 11054 16912 
22.705 —_ _— 26234 41233 
25 .687 —_ _ _ 66327 


Following the computational procedure outlined in Schedule I, with 
the identity matrix placed at the right of the normal equation matrix, we 
record in Schedule III the numerical values found in the “summation” 
and “primed” rows of the computational algorithm. 

The formula for the sum of the squares of the residuals, to be used in 
the forward solution is (cf. Schedule I and [1, p. 164]) 


S® = >) wy? — arobio — geobso — +++ — geods0 
= 13.509609 — 13.506125 = .003484 
and the variance of fit is 
Vs = .003484/5 = .000697. 
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The forward solution for finding the 6’s from the relations (4.4) and 
(4.5) would be facilitated by a separate computational schedule which 
we do not show here. This would be obtained from Schedule III by 
multiplying the elements by in the first column corresponding to the 
primed rows by the figures which appear in the preceding “summation” 
rows obtained from the identity matrix (Cols. 8 to 12). If a long compu- 
tational sheet is used, this might be placed to the right of Schedule III 
to avoid copying the multipliers bis. 

One can summarize the numerical results as follows: The values of 
B;® give the differences at z=0. They are 


us(0) = 2.822632, Aus = — .975823, A’us = .2979619 
A*us = — .0702948, A‘us = .00947051. 


The orthogonal polynomials implicit in the solution defined by the ele- 
ments in the summation rows of the auxiliary part of Schedule ITI, are 


¢1 = 1, o: = z — 5.0, = (5)-Go2+7 


a-(5)-1(3)# 00-4 


oo (*) — (3.5) (5) 4 (5.011765) (* ) 


— (5.60294)2 + 2.68235 
which means that the quartic can be expressed as (see (4.5) above) 
Us = tigi + taba + babs + lads + bods. 


Subtotals obtained from the forward solutions of the §’s give the 
differences of the intermediate polynomial curves fitted to the data. 
These are 


Straight Line: uw, = 1.920665, Au, = — .235073, 

Quadratic: us = 2.574341, Aus = — .655294, Au; = .0933824, 

Cubic: us = 2.797229, Aus = — .922760, A2um = .2419744, 
A*u, = — .03714802. 


Using (4.10) and the above computations the first approximation to 
the variance of the fitted quartic at any point z is given by 
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8u? = (.000697) [0.1 + [¢2(x) ]?/110 + [¢s(x) ]*/187 
+ [$a(x) ]2/(171.6) + [bs(x) ]?/(60.1446) ]. 

The above set of orthogonal polynomials will of course be valid for 
any series of y data arranged at 10 equally spaced intervals with no 
value recorded at the central value of x, where x =0 at one extreme and 
z=10 at the other extreme. Because of the simple nature of the dis- 
tribution of the statistical weights over the scale of x; it may be of in- 


terest to record the orthogonal polynomials more in detail. The exact 
values of the last three coefficients of ¢5 are 


5.911765 = (100.5)/17, 5.60294 = (95.25)/17, p2-88235 = (45.6) /17. 


Lg * 

Since the weighting is symmetrical about the central value of z, x=5, 
the orthogonal polynomials will satisfy the usual symmetrical relations. 
They can be written as: 

gd: = (x — 5), 2¢3 = (x — 5)? — 11, 

6d, = (x — 5)* — (17.8)(z — 5), 
1544.4 
17 


443 
2465 = @- tf --+¢ — 5)? + 
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ESTIMATION OF THE PARAMETERS OF TYPE III 
POPULATIONS FROM TRUNCATED SAMPLES 


Des Ras 
University of Lucknow and Indian Statistical Institute, Calcutta 


The problem considered is the estimation of the mean and 
variance of Type III populations from singly and doubly 
truncated samples with known truncation points when the 
number of unmeasured observations is (i) unknown for each 
tail, (ii) known separately for each tail, (iii) known jointly for 
the two tails. For singly truncated samples, estimation of all 
three parameters has also been considered. Estimating equa- 
tions are obtained by the method of moments as well as by 
the method of maximum likelihood. Previous results for the 
normal and Type III populations are particular cases of the 
results obtained in this paper. Numerical examples are given. 


HE problem of estimating the mean and variance of normal popula- 

tions from truncated samples has been considered by Pearson and 
Lee [10], Lee [9], Fisher [7], Stevens [12], Cochran [1], Hald [8], Cohen 
[2] and [4], and the author [5]. The corresponding problem for the Type 
III population, of which the normal population is a particular case, is 
considered by Cohen [2] for singly truncated samples when the number 
of unmeasured observations in the omitted portion is not known. The 
object of this paper is to study Cohen’s problem when the number of 
unmeasured observations is known and to estimate the mean and vari- 
ance of Type III populations from doubly and singly truncated samples 
with known truncation points when the number of unmeasured ob- 
servations is (¢) unknown for each tail, (it) known separately for each 
tail, (477) known jointly for the two tails. 


I. THE PROBLEM 
The Type III population may be written as 


20 
(1) f(x)dz, p-—Sz< oO, 
as 


where 


a3”)— 
(2) f(z) = <h1 + =(- ne ‘\] nels ie tlade-wle 
Co 


o 
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and 
© = (4/ag2) 4/28") 1/2941 03° [114 /aig2) J“, 


The mean, u, and the variance, o?, of this population are unknown 
while in the first instance a3, the third standard moment is supposed to 
be known. Let ao’ and 20’ =20’'+R be the left and right truncation 
points respectively. Let there be n; unmeasured observations in the 
tail to the left of xo’, nz unmeasured observations in the tail to the right 
of zo’ and mo measured observations in the sample range. We shall 
estimate the mean yz and the variance o” in the three cases: (4) when 
and mz are not known, (i) when 7m; and nz are known, (ti7) when 
m+n2=N—no is known. 


II. ESTIMATION BY THE METHOD OF MOMENTS 
The truncated population may be written as 
(3) S(z')dz’, OS2' SR, 


where 


(4/a3*)—1 
(4) f(z’) = e[ + 3 (2’ + ‘| e~(2/as) (2+ 8), 


and 


, 
Zo 
Poe cl 

—€& = 2’, 


(5) 
=e +—. 
g 


It is easy to see that f(z’) satisfies the differential equation 


(6) _ = ix!) oo (G ‘) ve 


fle’) ax’ o.\, a, 
(14+ S¢)+3: 





Case (t). To obtain the moments of the truncated population about 
Zo’, we separate the variables in (6), multiply both sides of the resulting 
equation by 2z’* and integrate over the range of the truncated popula- 
tion. Putting r=0, 1, the first and second moments are given by 
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‘in = | - + 0(Z - 2) - = ~2,| 
ma € g 1 2 2 6 21>» 


wo den oft + et4 (= ii :) (Zi — Zs) 


tafe 3(34)} 
L o ' 2\2 o 








where 
f ere -1 
a = s(¢)| ftoat| | 
’ 
(8) ” s 
Z2 = ” ' 
Za = 108" f : fiat] 
and 
a3 7) (4/a3)—-1 
(9) {= ef + =] e-(2las)t, 
(10) @=1+ = t’. 
Let 
(21) = > (x3 — 2o’)*/no 


be the rth moment of the sample of no measured observations about 
2o’. Equating the population and sample moments, the estimating 
equations obtained are 


R 
y= “| - &’ + 0(Z; — Z2) ose = 2a], 
2 «6 


(12) V2 


oft + e+ (= “ :) (Z, — Zs) 


-£afienet)} 


These equations in é*’ and o* can be solved by Newton-Raphson method 
Here Z, and Z; can be calculated from Salvosa’s tables [11]. Of course, 
u* is given by 


1953 


out 
ing 


od 
se, 
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(te! — w*)/o* = & 
Case (ti). The first two moments of the left tail about zo’ are 
_ o| e+ o™y,], 


nN 


o-(G-) (e+) 


while the first two moments of the right tail about 2’ are 


as R No 
M, = of - ¥+(0+2 =)" v4], 


a/ne 


m 
(13) 


m 


(14) 
a3 R a3 R No 
MM: = ab + §+ (« +— ~)(= — ’ + =)" rs], 
20a 2 c/N 
where 
ny e me 
rs = yey] sioae] 
(15) 0 —2/a3 


Ne - i 
va = ye sibat] 
No ge"! 


Considering m, mz and M,, M, as the contributions per unit observa- 
tion of the left and right tails towards the first and second moments 
respectively, the modified moments of the total sample of mo+-71-+72 
observations about xo’ may be written as 


vy’ = (nym + non + 2M) /(n + mo + n), 
vo! = (mym2 + Nove + M2M2)/(m1 + Mo + Ne). 


The moments of the complete population about 29’ are given by 


(17) sales 
2’ = (1 + &”*)o". 


Equating the population moments to the corresponding sample mo- 
ments, the estimating equations obtained are similar to (12) where 
Y, and Y: are substituted for Z,; and Zz respectively. 

Case (iit). The first two moments of the combined tail about xo’ are 
given by 


n=e[-¥-0 


(16) 


No 








as R No 
%,- %)+——- XI, 
(% y+ ¢ N—MN | 
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R as R No 
o 2 o 


™3—1 


e {1 4 2 (= i _— ~ (Xi Xi) 


N- rope ~ 
X m 4(e!) f soat [ semat 
n Lo —2/as a 


’ 


(19) : 


34-1 


N — 47 : Jf 
X, = ——" ee" J, soar f. fib 








The moments of the total sample of N observations about 2’ are given 
by 

= [non + (N — no)gil/N, 

= [nove + (N - no)g2|/N. 

Equating the sample moments to the population moments (17), we 


obtain estimating equations similar to (12), where X, and X: are sub- 
stituted for Z, and Z, respectively. 


(20) 


III. MAXIMUM LIKELIHOOD ESTIMATION 
Case (t). The likelihood function of the sample may be written as 


K gre 
n= EL f roa] “eo #(24 2) 
ono t’ 


no ” (4/a3*)—1 
Tif1+ S(=-+¢)] 
1 Z2X\¢ 


to’ — pb xo’ — p 
(22) 2” =2— 2’, wae, e! = — , k = constant, 
Co Co 


(21) 


where 


and f(t) is given by (9). Taking logarithms of (21) and differentiating 
partially with respect to ¢’ and o and setting to zero, the equations ob- 
tained are 


(17/4 2 
~(— - 1)S8-—+ (- 2 = 0, 
No as” 2 3 


(23) 
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where 


(24) 8 = z[1+ af +e]. 


and Z,, Ze, v1 given by (8) and (11) respectively. From these we obtain 
the maximum likelihood equations 


: a3 R 
-| - t’ + 6(Z, _— Z2) = 2 = 2, — & = 0, 


~(=- 2) S[o+ 2 =] pe eee 


No \a@ 2 1 2¢ a3 


(25) 


These equations in ¢’ and é can be solved by Newton-Ralphson method. 
Case (it). The likelihood function of the sample in this case may be 
put as 


I, = =[ fo] | Sloat] "eer 
ES + ape 


Proceeding as in case (7), the estimating equations obtained are similar 
to (25), where Y; and Y; are substituted for Z; and Z: respectively. 
Case (t7t). In this case the likelihood may be written as 


K 4 N—ng no / y!! 
L; = _ | J S(t)dt + J. “Koa oon x(= + :’) 
no ” (4/a3)—1 
fif1+ 2(— + )| - 
1 2\0¢ 


Proceeding as in case (1), the extimating equations obtained are similar 
to (25), where X; and X; are substituted for Z; and Z: respectively. 


IV. THE SINGLY TRUNCATED CASE 


The results for the singly truncated case can now be obtained in 
particular. In case truncation is on the left only, 10’. 
So that 
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f ¢”’ and R—> o, 22, RZ, Xe and RX, —— 0; 
Z1— dé, X1—t(é"), where 


(4/a3?)—1 
A(é’) = E + | em (2/aa) &’ 


¢’ as (4/a3?)—1 -1 
[ (1 oe =) eoimora| . 
—2/ a3 2 





m1 
#8") = — X(¢’). 
No 


Making relevant substitutions from (27) in case (itz) of Section II, we 
obtain the following equations of estimation by moments, when 7, the 
number of unmeasured observations in the left tail, is known. 


n = o[ot(e’) — &’], 
28 
” ve = 0? {o + |= ~ | [oe(é’) — el} 
To solve these equations, we eliminate o to obtain 

V2 1 6 a3 
29 —= ——-’|. 
” Saves PT * | 


Corresponding to any ¢’, the right-hand side of (29) can be evaluated 
from Solvosa’s tables of areas and ordinates of the standardised Type 


III function. Hence, for a given sample ¢’ is determined from (29). 
And then 





o* = y,[0*t(e*’) — J“, 
(30) * , */ 

pb" =X — Eo". 
Again, substituting in case (777) of Section III, the maximum likelihood 
equations for this case are 


2 

1)8 -~ +4) = 0, 
a3 

(31) 4 2 VY 

-=+-(+- 1)¢s+— =o, 


as” No \ as? a3 ¢ 


It is easy to see that ?’ is given by 
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=) xf +2 = lore’) — vl} 4 
2 


a 


+ t(é’) = 0, 


(33) & = »,[6t(2’) — #], 
and 
(34) jp = 20! — £6. 


In case the number of unmeasured observations in the left tail is not 
known, we make relevant substitutions in case (t) of Section II and in 
ease (t) of Section III and obtain equations by the method of moments 
and by the method of maximum likelihood respectively. The manner of 
solving these equations is the same as before. 


V. ESTIMATION WHEN az IS NOT KNOWN 


We shall consider the singly truncated case when, m1, the number of 
unmeasured observations in the left tail is known. The third moment of 
the left tail about x0’ is given by 


2 
(35) ms = o| ~~ 0(2 +e 4 = - Sr/)awy]. 


and the third modified moment, »3’, of the total sample of no+-7 ob- 
servations about 20’ is given by 


vs’ = (nym + Novs)/(Mo + M1), 
while the third moment, y;’, of the population about 2p’ is 
(36) us’ = (a3 — 3’ — &’*)o%. 


Equating the first three moments of the population to the correspond- 
ing modified moments of the sample, the equations of estimation are 
(28) and 


y= 0 {6 — &’) 
(37) 
+ [ot(é’) — ¢] |= (as — &’) + 2+ 2|\ ; 
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To solve these equations, we estimate o to obtain the following. 





ve ‘ a 1 6 = ’ 
et = Q(&:’a3) = oe) — sla =P" 5 |. 





V3 ’ as 1 
(38) ry = P(&'a3) = [owe — &']2 


{ e+ 2+ ( "| a. |} 
” os ae’) — #2 
From these equations we obtain ¢*’ and as* by Newton-Raphson method. 


In cases, however, when truncation is on the right, the equations of 
estimation are 


(nm = o[oT(é’) — &’], 
nad {0 + (= - ') [oT (e’) - vl} 


ot fo(ea - &) 





+ [= (as — &’) + &% + 2| [oT (é’) — 2) 


(439) 
o= 2° ’ 


(4/a37)—1 


T(t’) = =[1 = | elas! 
No 2 


C) as (4/a3”)—1 -1 
{f (1 on =) eaten , 


For a given ¢’, T(é’) can be calculated from Solvosa’s Tables. The man- 
ner of solving these equations is the same as discussed above. 


(41) 


VI. EXAMPLES INVOLVING SINGLE TRUNCATION 


(a) Method of moments. The method of estimating the parameters in 
this case can be best illustrated by considering the example given by 
Cohen (1950). This example was constructed by arbitrarily truncating 
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a sample distribution of the weights of 629 University of Washington 
Freshmen so that the record of observations less than 110.5 Ibs. is 
missing. It is known that there are 15 observations in the omitted por- 
tion of the distribution. For the truncated distribution we have 


mo = 614, m=15, 2’ = 110.5, = 32.6596, 
vp = 159.2798, vs = 67269.15. 


Case (I). as is known. In this case it is given that a;=0.6. The equa- 
tions in ¢’ and o are 


32.6596 = o[at(t’) — &], 
1359.2793 = o*{@ + (.3 — £’) [at(é’) — é’]}. 

The equation to determine ?’ is 

[ 6 

at(t’) — &’ Lat(é’) — &’ 
For a first approximation for ¢’, we read from Salvosa’s Tables the 
value of ’ such that the area of the standardised Type III curve from 
—2/a; to ¢ is 15/629=.023847. Then a first approximation is 


t’ = — 1.68. Details of further computations for determining é’ are given 
in the table below. 





1.27434 = Q(t’) = 


+ 3- |. 


TABLE 1 


COMPUTATIONS FOR Q() 
as =0.6 








ra I(t’) Ae) “(e’) “od +06) Q(e) 


a 
(8) — 8 





(1) (2) (3) (4) (5) (6) (7) (8) 





—1.68 -023402 -089266 3.814460 .093187 -579300 2.267333 1.31347 
—1.75 -017741 -072787 4.102756 - 100230 - 556295 2.314240 1.28740 
—1.79 -015003 -064209 4.279744 - 104554 -543949 2.341848 1.27385 
—1.78 -015656 -066295 4.234479 - 103448 -546984 2.334895 1.27715 
—1.789 -015068 -064418 4.275153 - 104442 -544251 2.341152 1.27417 





Thus ¢’ = — 1.789. From (30), o* =17.775 and p* = 142.30. 
Case (II). as is not known. In this case, the equations in a; and ¢’ are 


1.27434 = Q(E, os), 1.93101 = P(E’, as). 


The third standard moment as obtained from the truncated sample 
may be used as a first approximation for a3. Thus, a3=.7 for a first ap- 
proximation. A first approximation for ¢’ is obtained on the assumption 
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that the area of the Type III function from —2/as to ¢’ is 15/629. This 
gives ¢’=—1.62 from Salvosa’s Tables. Detailed computations in- 
volved in the various steps to solve the equations by Newton-Raphson 
method are given in the tables below. 


TABLE 2 
COMPUTATIONS OF Q(é’, as) AND P(@’, as) 








, 1 
Ti(é’) Se) re’) ee) ue) -F Q(t’, ax) P(E’, an) 





(3) (4) (5) (6) (7) (8) (9) 





-106606 4.076244  .099583  .608340 1.34910 
-054564 5.280046. -541205 1.27197 1.93773 
-056727 5.208613 .127246 . 1.27523 
-052450 5.352041 . - 538236 1.92641 


-066295 4.234479 . -546984 1.27715 1.94147 
-064209 4.279744. -543949 1.27385 1.93003 
-064418 4.275153. 544251 1.27417 =1.93117 





TABLE 3 
DIFFERENCES OF ¢’ AND as 








t’ from Q(t’, as) t’ from P(é’, as) Difference 





—1.793 —1.806 .013 
—1.789 —1.789 .000 





We thus find that a;=0.6 and ¢’ = —1.789. Consequently, the esti- 
mates of u and o are the same as obtained in Case (I). The results ob- 
tained by Cohen (1950) for the case when the numbers of unmeasured 
observations is unknown, are shown in the table below. 


TABLE 4 
ESTIMATES OF THE PARAMETERS 








Values from Estimates obtained by Estimates obtained in 
complete Cohen this paper 
sample a;known a; unknown a;known a; unknown 


Param- 
eters 





0.59 0.60 0.71 0.60 0.60 
—1.805 —1.850 —1.878 —1.789 —1.789 
19.59 17.43 17.28 17.775 17.775 
142.25 142.75 142.95 142.30 142.30 
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(b) Method of maximum likelihood. The method of obtaining the 
maximum likelihood estimates is best illustrated by taking a random 
sample from the Type III population with a;=0.6, u = 100.00, « = 10.00, 
truncated at 90.00. The random sample is 


94.92 96 .06 103 .93 96.15 
100.93 104.53 105 .94 94.62 
90.88 100 .67 93.46 103 .43 
93 .26 131.28 94.11 103 .84 
92.39 90.22 90.49 91.57 
99.59 94.42 95 .42 102.71 


For the sample selected, we have 
nm = 30, m=5, m = 9.373, ve = 172.74529, 2x’ = 90.00. 


On the assumption that the area under the standardised Type III 
curve with a3=0.6 from —2/as to ~’ is 5/35 =.142857, a first approxi- 
mation for ¢’ from Salvosa’s Tables is given by ¢’ = 1.04. Beginning with 
this value we finally obtain the following table. 


TABLE 5 
COMPUTATION OF R(é’) 





e’ S 





— .75 30 .916744 
— .80 28 .811959 
— .76 29 . 880896 





Hence, ¢’ = —.755 by interpolation. 
From (33) and (34), é=9.8198, 7=97.4139. The estimates obtained 
by the method of moments are 


g*’ = — .75, o* = 9.8773, u* = 97.408. 


The two methods appear to give estimates close to each other. 


VII, AN EXAMPLE INVOLVING DOUBLE TRUNCATION 


The manner of solving the estimating equations in this case will be 
illustrated by drawing a random sample of size 30 from the Type III 
population with parameters a;=0.6, u=100, c=10, truncated at 
to’ =85 and 29’ = 115. Only case (1) will be considered as the method is 
the same for other cases. 
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The sample drawn is 


99 .937 101.172 112.459 109.973 99 .467 
102.483 95 .331 97 .380 95.774 95.591 
95.441 102.363 99.116 86.741 96 .028 
102.497 87.314 97 .187 88 .063 98.965 
102.807 86 .637 94.561 100.306 110.087 
110.324 105 .355 107 .434 99.955 97 .805 


For the sample selected, we have 
no = 30, vw, = 14.118433, ve = 246.292945, R = 30. 
(a) Method of moments. The equations to be solved are 


30 
P(é’, oc) = -| - ¢’ a (1 a 3’) (Zi, ~~ Z:2) = 3 = ~ 2a 
— 14.118433 = 0, 
Q(t’, o) = o? 51 + 2 + (1 + .88’)(.8 — &)(Z1 — Ze) 
30 30 
-— Z,[1 + 3 (3 + =) — 246.292945 = 0. 
o go 


As a first approximation for o, we use the sample standard deviation 
s=6.853 and for ~ the corresponding quantity calculated from the 
sample, viz., —vi/s=—2.06. After several steps we obtain the table 
given below. 
TABLE 6 
DIFFERENCES OF o AND ?’ 








o ¢’ from P(t’, o) é’ from Q(é¢’, ) Difference 





8.28729 —1.7960 —1.7963 + .0003 
8.31025 —1.7917 —1.7913 — .0004 





By interpolation we have, &* = — 1.7942, c* =8.2971 and consequently 
u* = 99.8867. 
(b) Method of maximum likelihood. The equations to be solved are 


30 
(1°) -| - &’ + (1 + .38’)(Z1 — Z2) — 3+ =z] - 14.118433 = 0, 
g 


(2°) 101111 > [1 + .3¢ +.3 =} — 3.33333 + (Z: — Z) = 0. 
g 


First approximations for o and ~’ are the same as in (a). 
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After several steps we obtain the following table 


TABLE 7 
DIFFERENCES OF « AND ?’ 








o t’ from (1°) &’ from (2°) Difference 





8.982036 —1.6739 —1.6752 + .0013 
8.902077 —i.6871 — 1.6860 — .0011 





From this we have ¢=8.9387, #’ = —1.6811, 7 = 100.0269. 

It may be noted that the two methods do not give the same esti- 
mates. These estimates, however, will be identical when a;=0, as 
shown by the author [6]. 
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PROCEEDINGS 


AMERICAN STATISTICAL ASSOCIATION 
112TH ANNUAL MEETING 


PALMER HOUSE, CHICAGO, ILLINOIS 
DECEMBER 28, 1952 


MINUTES OF THE ANNUAL BUSINESS MEETING 


The meeting was called to order by Aryness Joy Wickens, outgoing President 
of the Association. 


Report of the Committee on Elections 


R. J. Eggert, Chairman of the Committee on Elections, reported to the mem- 
bership the results of the balloting for officers for 1953. The following officers 
were elected: 


President Elect Herbert Marshall 
Vice President Rensis Likert 
Directors (1953-55) Wilfrid J. Dixon 
Margaret J. Hagood 
Representative at Large (1953-54) Kenneth Miller 
District Representatives 
Northeastern District David Votaw, Jr. 
Eastern District A. J. Jaffe 
Southeastern District Ezra Glaser 
North Central District William Madow 
South Central District John Stockton 
Western District Harry Schwartz 


Nominations for District Representatives 


Mrs. Wickens presented to the membership some of the difficulties attached to 
the nomination of district representatives. She stated that the matter would be 
presented to the incoming Board and Council. However, she remarked that it 
was up to the membership to make these nominations and that heretofore the 
aominations were late in arriving at the Secretary’s office. Since this was a mat- 
ter of interest to the whole membership, Mrs. Wickens urged that these nomina- 
tions be forwarded to the Secretary’s office early in the year. 


The Report of the Board of Directors for 1952 


Morris Hansen read the report to the membership. It was moved that the 
report be received and approved. The motion was carried. 


Secretary-Treasurer’s Report for 1952 

Samuel Weiss read the report of the Secretary-Treasurer to the membership. 
It was moved that the report be accepted and approved. The motion was carried. 
Coming Annual Meetings 


Mrs. Wickens asked Mr. Weiss to present to the membership the dates and 
places that had been decided upon for future annual meetings of the Association. 
They are as follows: 
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Year Place Hotel Date 

1953 Washington, D. C. Shoreham December 27-30 

1954 Montreal Mount Royal September 10-13 

1954 San Francisco (This is a regional meeting—hotel and dates 
not yet chosen.) 

1955 New York City Biltmore Late in December 


Resolutions 


Mrs. Wickens called upon Ralph Watkins to present the following resolutions 

to the membership: 

1. Resolution regarding the Program Committee. 

RESOLVED that the officers and members of the American Statistical 
Association express deep appreciation for the excellent program prepared 
by the members of the Program Committee under the leadership of Alfred 
Watson, Chairman, and Edward Bloom, Secretary. 

The resolution was approved. 

2. Resolution regarding local Arrangements Committee and Chicago Chapter. 
RESOLVED that the officers and members of the American Statistical 
Association express their profound appreciation to the Local Arrangements 
Committee under the Chairmanship of Wesley Mitchell and to all of the 
individual members of the Chicago Chapter for their outstanding work and 
hospitality in connection with the arrangements for the 112th annual meeting 
of the Association. ' 

The resolution was approved. 

3. Resolution regarding the retirement of Sylvia Weyl. 

WHEREAS, Sylvia C. Weyl] has ably and faithfully served the Association 
in the capacity of Executive Assistant and Editor of The American Stat- 
istician, the former from 1944 to July 1952, and the latter from 1947 to De- 
cember 1952, and has now retired from these positions; 

THEREFORE, BE IT RESOLVED, that the members of the American 
Statistical Association in annual meeting assembled do hereby express great 
appreciation of the outstanding service rendered by Mrs. Weyl during her 
terms of office, and hereby express our recognition of her valuable assistance 
so competently performed over this long term of years. 


The resolution was approved. 
Mrs. Wickens turned the chairmanship of the meeting over to Professor 
Cochran, the president of the Association for 1953. 


New Business 


Professor Cochran asked if there was any new business to be discussed. It was 
moved that the membership consider a time other than Christmas week for the 
1956 annual meeting of the Association. The motion was carried. The meeting 
was adjourned. 


Report of the Board of Directors for 1962 


During the calendar year just ended, the Association continued its policy 
of strict economy, added moderately to its surplus, registered a significant in- 
crease in membership and expanded the scope of its manifold activities. 
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Program Committee 


Organized around the theme, Applications and Uses of Statistics, the Annual 
Meeting was planned and directed by a Program Committee chaired by Alfred 
Watson with Edward Bloom as secretary. The section on Business and Economic 
Statistics under the chairmanship of Donald Riley, the section on the Training 
of Statisticians under Philip McCarthy, the Biometrics section under Alexander 
Mood, as well as the Committee on Statistics in the Physical Sciences under the 
chairmanship of John Tukey and the Committee on Statistics in the Social 
Sciences under Conrad Taeuber all contributed substantial sections of the Annual 
Meeting Program. 


Bureau of Mines Survey 


The Board is happy to report that the Survey of the Statistical Program of the 
Bureau of Mines under the able direction of J. E. Morton, advised by a commit- 
tee consisting of Raymond Bowman, Ralph Watkins and Clarence Long, was 
completed last summer. The Report has been submitted to the Bureau of Mines 
for its consideration and action. 


Other Committees 


The Association’s advisory committee to the Bureau of Labor Statistics was 
engaged during 1952 in the task of cooperating in the preparation of the new 
consumers’ price index. The committee to advise the Bureau of the Budget on 
statistical policy reports started its work during the calendar year. 


Business and Economic Statistics Section 


This Section drafted its charter last year, which was thereupon submitted to 
and accepted by the Board which modified it slightly and referred it back to the 
Section for its consideration. This Section took the leadership in holding two 
conferences during 1952, one in Illinois and one in Pennsylvania. The Penn- 
sylvania meeting was organized in conjunction with The University of Penn- 
sylvania, Wharton School of Business Administration on the topic, ‘““The Role 
of Statistics in Business Planning and Control.” The Middle Western meeting 
was organized with the University of Illinois around the topic, ‘Minimizing 
Business Risks.’’ All institutional members of the Association and all members of 
the Section were invited. 


Tulsa Chapter 


The Association welcomes its newest chapter, organized in Tulsa, Oklahoma, 
in 1952. The proposed Constitution of the group has already been approved and 
its Charter granted. 


The American Statistician 


After having served over five years as Editor of The American Statistician, 
the Association’s news publication, Sylvia Weyl asked to be relieved from her 
assignment. Mrs. Weyl] was the first editor of The American Statistician and from 
its inception in August of 1947 she made The American Statistician a medium 
for the discussion of non-theoretical matters of concern to the statistical pro- 
fession, including the aims and status of the profession, discussions of statistical 
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institutions and non-theoretical expository articles on current statistical activi- 
ties and projects. Under Mrs. Wey!’s editorship, a substantial number of articles 
dealing with the training of statisticians, the needs and activities of business 
statisticians and descriptions and discussions of practical aids to working statis- 
ticians have appeared. The publication has filled the void between the technical 
professional discussions of statistical theory which appear in our Journal and the 
simple listing of program notes and announcements which used to be carried in 
the Association’s Bulletin. Mrs. Weyl did a remarkable job of producing a read- 
able and useful publication of a sort not previously undertaken by professional 
societies. 

An ad hoc subcommittee of the Committee on Publications was designated to 
search for a new editor. Pursuant to its recommendations, The American Statis- 
tician will be edited under the sponsorship of the Wharton School of Business 
of the University of Pennsylvania. Dr. Raymond Bowman has agreed to give 
the necessary guidance and supervision; Almarin Phillips of his staff will serve 
as Editor. 


JASA 


At the request of W. Allen Wallis, Editor, and at the recommendation of the 
Secretary, the Board approved an expanded budget for the Journal of the 
American Statistical Association during the year jusc ended. Size and content of 
the periodical were expanded, pages printed in 1952 were 20% greater than in 
1951. 


Constitution 


The Committee on the Constitution submitted a revised version to the Board 
of Directors. The proposed revisions include a number of changes of a non- 
controversial character designed to permit greater ease and flexibility of opera- 
tions. After careful consideration, the Board approved the Committee’s pro- 
posals. The Constitution in its revised form is being submitted to the member- 
ship for approval. 

Iiaison 

The Board is desirous of increasing membership participation in the direction 
of the Association’s affairs. With this in view, it was decided to send minutes of 
all its meetings to the chairmen of sections and of standing committees, as well 
as to the members of the Board and Council. In order to ensure clearer present- 
tion of the various opinions and viewpoints expressed at Board meetings, the 
Board at its last meeting decided to appoint a special rapporteur to prepare a 
summary of its views which will supplement the formal minutes. 


Report of the Secretary-Treasurer for 1962 


During 1952, the Secretary’s office continued to devote its efforts to extremely 
careful management. Through rigorous economy the Association completed its 
third successive year in the black. During 1952 the increase in the Association’s 
surplus resulted mainly from strict financial control rather than from member- 
ship gains. 

By the end of 1951, it had become apparent that Association membership had 
remained virtually static for a period of three to four years. Accordingly, the 
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Secretary’s Office undertook a special membership drive last year, circularizing 
by letter a large number of mathematicians, economists, agricultural statisticians 
and others whose professional activities brought them into close relationship 
with the work of the Association. This campaign was interrupted last summer, 
but will be resumed in 1953. It is encouraging to note that the Association has, 
for the first time since 1948, shown a significant growth in membership. 

The number of members at the end of 1951 was 4,356. During 1952 about 350 
members will be dropped because of resignation, death, or non-payment of dues. 
New members in 1952 totalled 666. Thus the Association starts the year 1953 
with 4,655 members. The net membership growth in 1952 was 300 members— 
the first net increase in 5 years. 

Expenses for 1952 were budgeted at $48,666. The actual expense for the year 
was $47,529.49—a drop of over $1,000. Although the increase in the cost of 
publications was about $300 over the 1952 budget, certain other expenses were 
kept under budget level by saving wherever possible. 

While the saving in any particular item was not large in itself, the combined 
total was just about sufficient to offset the rise in the cost of publications. The 
actual income for 1952 was $54,061. This is about $700 under the budget figure. 
The loss is primarily due to the decrease in the sales of back issues of the Journal. 
The actual net income for 1952 is $6,531 which provides for a substantial increase 
to the Association’s surplus. Thus the Association will start the year 1953 with 
$16,965 in surplus funds. 


Office Organization 


As a result of the resignation of Sylvia Weyl, who had been Executive Assistant 
to the Secretary since the beginning of 1944, it was necessary to reorganize the 
Secretary’s office. With the agreement of the liaison committee of the Board of 
Directors, Edgar Bisgyer was appointed to serve in a managerial capacity in the 
Secretary’s office. Mrs. Weyl, who in addition to her administrative work had 
been editor of The American Statistician since it was founded in 1947, agreed to 
continue to edit that publication until the Board of Directors and the Secretary 
were able to find a suitable person to assume the editorship. The transfer of edi- 
torial function from the Washington office to the University of Pennsylvania 
campus will thus be carried out in an orderly fashion and without loss of ef- 
ficiency. 


Financial Recommendations 


The Treasurer’s Report, which is printed separately, shows the strengthened 
financial position of the Association and emphasizes that 1952 was the third suc- 
cessive year of accruing surplus. In this connection, the Secretary has recom- 
mended to the Board of Directors that the Association plan to build upa surplus 
equivalent to one full year’s income before entering into any major expansion 
of its activities. It is proposed that approximately $3,000 be added to surplus 
each year until this objective is attained. With this in mind the budget for 1953 
has been calculated to provide a net income of slightly over $3,000. The proposed 
income is budgeted at $50,650, while expense has been calculated at $47,612. 
While expense is calculated as closely as possible, income is figured very con- 
servatively and actual income figure for 1953 may be expected to be somewhat 
larger than that listed in the budget. 
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To the Board of Directors of 
American Statistical Association. 


I have examined the attached financial statements of American Statistical 
Association relating to the year ended December 31, 1952. My examination 
was made in accordance with generally accepted auditing standards and, ac- 
cordingly, included such tests of the accounting records and such other auditing 
procedures as were considered necessary in the circumstances. 

The recorded cash receipts for the year were traced to the deposits shown on 
the bank statements and the amounts for dues and subscriptions were tested 
with the membership and subscription records. The paid checks were inspected 
and related vouchers tested in support of cash disbursements for the year. The 
bank balances were reconciled with amounts reported directly to me by the 
depositories and the cash on hand at December 31, 1952, was verified by inspec- 
tion. I did not check the membership and subscription records in detail or make 
any independent verification of the inventory of old Journals, the office records 
of which are based, in part, on data assembled in prior years. 

in accordance with a resolution of the Board of Directors, the expense incurred 
in publishing a directory, distributed to the membership in 1951, is being spread 
over a three-year period although such costs would appear to be applicable pri- 
marily to the year 1951. The amount deferred at December 31, 1952, aggregated 
$831.86. 

In my opinion, the accompanying statements present fairly the position of 
American Statistical Association at December 31, 1952, and the results of its 
operations for the year, in conformity with generally accepted accounting prin- 
ciples applied on a basis consistent, except as mentioned in the preceding para- 
graph, with that of the preceding year. 

James G. JESTER 
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AMERICAN STATISTICAL ASSOCIATION 


BALANCE SHEET 


Assets 


Cash in banks and on hand 

Accounts receivable 

Investment in United States Savings Bonds, 
Series G, due 1962, at cost 

Inventory of old Journals, at approximate cost 

Inventory of monograph on Acceptance Sampling, 


Inventory of emblems, at cost 
Furniture and fixtures, at cost less depreciation... . 
Deferred charges: 


Accounts payable 
Deferred income (collections applicable to subse- 
quent year): 


Net Worth: 
Life membership reserve 
Surplus, per statement 


December 31, 


1952 


$39 ,503. 
1,099. 


3,100 


1,909. 


233 
463 


2,192. 


831 
1 ,007 


14 
58 


-00 


85 


-93 
-50 


42 


.86 
.68 


1951 


$28 ,618 .28 
1,385.48 


3,100.00 
1,625.85 


452.54 
588 .00 
2,026 .99 


2,131.86 
723 .48 





$50 ,342. 


15 


$40 ,652 .48 








$ 5,930. 


$16,179. 
5,524. 
2,865. 


$ 4,811.46 


$13 ,618 .00 
4,379.60 
4,365 .67 





$24 ,568. 


$22 ,363 .27 





$ 2,877. 
16,965. 


$ 3,044.10 
10 , 433 .65 





$19,843. 


$13,477.75 





$50 ,342. 


$40 ,652 .48 
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AMERICAN STATISTICAL ASSOCIATION 
STATEMENT OF INCOME AND SuRPLUS ACCOUNTS 


Year ended December 31, 
1952 1951 
Income: 


Dues—current year $37,101.00 $33,358.00 
Dues—prior year 184.00 400 .00 
Life membership income...................... 166.31 84.40 
Subscriptions 9,971.35 9,433.15 
Advertising 1,594.47 2,251.13 
Journal sales, less cos’ of sales 1,079.14 1 ,907 .03 
American Statistician ezles 82.65 158.65 
Acceptance Sampling monograph, less cost of 
24.39 210.60 
Mailing list income 704.17 683 .34 
Biometrics sales 453 .25 490 .80 
Interest income 541 .37 395 .40 
Reimbursement of overhead expenses: Bureau 
of Mines project 2,000 .00 1,500.00 
Miscellaneous 159.08 702 .63 





Total income $54,061.18 $51,575. 





Expense: 


Journal—printing, mailing and reprints $12,858. $ 9,997. 
Salaries and wages 14,716. 14,817. 
American Statistician 5,810. 4,815. 
Membership Directory—apportioned expense... . 1,300. 1,233. 
Annual meeting expense 550. 141. 

2,400. 2,400. 

2,434. 1,659. 

1,163. 1,911. 
Promotion expense 702. 2,375. 
Telephone and telegraph 563. 482. 
Travel and committee expense 2,001. 1,196. 
Depreciation of furniture and equipment 555. 510. 
Accounting services 970. 970. 
Miscellaneous 1,501. 1,947. 





Total expense $47 ,529. $44,458. 





Excess of income over expense for the year $ 6,531. $ 7,116.22 
Add: Surplus account at beginning of year........ 10,433. 3,317.43 





Surplus account at end of year $16 ,965. $10 ,433 .65 





BOOK REVIEWS 


Wesley Clair Mitchell: The Economic Scientist. Arthur F. Burns, editor. New 
York: National Bureau of Economic Research, Inc., 1952. Pp. viii, 327. $4.00. 
See the article by Adolf A. Berle, pp. 169-175 in this issue. 


An Introduction to Scientific Research. EF. Bright Wilson (Theodore William 
Richards Professor of Chemistry, Harvard University). New York: McGraw- 
Hill Book Company, 1952. Pp. v, 365. $6.50. 


E. L. Lenmann, University of California (Berkeley) 


HIs volume has as its primary concern the experimental and observational 

aspects of science, and its main purpose is to provide a collection of gen- 
eral principles and specific methods applicable to the planning and analyzing 
of scientific work of this kind. Under these circumstances it seems quite 
natural (at least to a statistician) that the concept of randomness pervades 
the book, and that of the eight principal chapters, five are of an essentially 
statistical character. Taken together, these form an introduction to the con- 
cepts and methods of modern statistics. 

Any book covering such wide territory faces a twofold danger. It may get 
lost in generalities which are of no real use to the practicing scientist. Alter- 
natively, it may become a collection of rules and recipes, which are not only 
tedious but also likely to be misused if they do not grow out of an under- 
standing of the basic concepts and are not set within a framework of theory 
that indicates the necessary limitations. The author manages to avoid this 
double pitfall by putting considerable emphasis on the conceptual side of the 
subject, and by developing the techniques as examples of the general ap- 
proach rather than in the dogmatic way one so frequently encounters. 

In setting forth the most important statistical notions, such as those of 
hypothesis testing, confidence intervals, factorial experimentation, and ran- 
domization, the author uses the device of first introducing a concept only 
qualitatively in a natural experimental setting. An analytical development 
is usually deferred until a later chapter and frequently the treatment of more 
specific problems at a still later point provides an opportunity for reiteration 
of the fundamental principles and further clarification of the conceptual dif- 
ficulties. This gradual approach saves a reader unfamiliar with the subject 
from being overwhelmed by so many new ideas. 

The development of the main statistical theories is carried out in Chapters 
4 (design of experiments, 33 pages), 8 (analysis of experimental data, 63 
pages) and 9, (errors of measurement, 45 pages), where numerous specific 
techniques are also given, such as testing and estimation in binomial, normal 
and Poisson distributions, testing for goodness of fit, analysis of variance, 
control charts. Many further methods are mentioned only briefly but always 
concretely, as for example run tests, sequential probability ratio tests, and 
tolerance limits. 
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Closely related to the above material is Chapter 7 (classification, sampling, 
and measurement, 16 pages) in which the concept of a sample is clarified, and 
scientific induction is discussed as the method of inference from a sample to 
the population from which it was drawn. A formal discussion of probability 
is postponed until Chapter 10 (probability, randomness, and logic, 22 pages), 
where it is defined in terms of a random sampling process and where the 
addition and multiplication rules are derived. The author here also discusses 
the problem of how te utilize prior information, and in this connection gives 
a brief account of the subjective point of view as presented by Jeffreys. 

As the author states, the book is intended primarily for “students begin- 
ning research and for those more experienced research workers who wish an 
introduction to various topics which were not included in their training.” 
While a number of derivations are sketched that require differential argu- 
ments, such as those of the ¢- and x?-distributions, these are usually given at 
the end of a chapter, so as not to interrupt the main development. Conse- 
quently, the book has much to offer even to a reader not familiar with the 
calculus. Of great help are the many examples, some particularly interesting 
ones being taken from the physical sciences, and the detailed discussion of 
the many pitfalls that threaten the unwary. 

If fault must be found, the reviewer would have preferred a clearly stated 
general definition of mathematical expectation and variance to the one given 
here by implication. He also missed a statement of the additivity of expecta- 
tions and other results of this type, which one feels might have been included 
in chapter ten. These criticisms, however, view the book as a text in statistics, 
which it is not meant to be. Also, they are concerned solely with the mathe- 
matical aspects of the theory. With regard to the meaning, intent and possi- 
bilities of statistical methods, the reviewer knows of no clearer or, on its 
level, more useful account. 

Finally, it should be mentioned that in addition to the five chapters re- 
viewed here the book contains five chapters of a general character (Chapter 
1, choice and statement of a research problem, 9 pages; Chapter 2, searching 
the literature, 11 pages; Chapter 3, elementary scientific method, 15 pages; 
Chapter 6, execution of experiments, 24 pages; Chapter 13, reporting the 
results, 10 pages); and three important technical chapters (Chapter 5, de- 
sign of apparatus, 58 pages; Chapter 11, mathematical work, 29 pages; 
Chapter 12, numerical computations, 22 pages). 


Statistical Theory in Research. R. L. Anderson and T. A. Bancroft. New York: 
McGraw-Hill Book Company, 1952. Pp. xix, 399. $7.00. 


CuurcHILL E1sennant, National Bureau of Standards 


His volume treats in detail an amazingly large number of topics in statis- 
tical theory and methodology, with careful attention to the assumptions 
and mathematics upon which they are based, and illustrates their applica- 











360 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1953 


tion through numerous worked examples in the text, plus a large number 
of well chosen exercises at the end of each chapter. Each chapter has its own 
list of “References cited,” to which explicit reference is made at appropriate 
points in the text. These are well chosen, as are the supplemental items listed 
as “other reading.” Together they will enable the reader (or teacher) to gain 
(give) fuller information on topics treated incompletely, or only mentioned, 
in the text. 

Written to fill a need expressed by “many research workers” for “a conven- 
ient reference book on statistical theory pointed to research problems, which 
could be used in conjunction with their books on general statistical methods, 
experimental design, and survey sampling,” this book should be well received 
by students and research workers in the fields of agricultural, biological, and 
sociological research who (1) have a “good background in differential and in- 
tegral calculus” and (2) have had some first-hand experience with conducting 
and interpreting experiments in one or more of these fields. For these, it will 
serve to explain the mathematical bases of the various statistical techniques 
with which they have become acquainted through their experimental work, 
or from elementary “practical” courses in statistical method, or both. Stu- 
dents and research workers in the physical sciences and engineering, on the 
other hand, will find the material covered herein somewhat less familiar, and 
its scope somewhat less satisfactory—some statistical principles and tech- 
niques of importance in these fields are only touched upon briefly here, and 
others are not even mentioned. 

Viewed as a textbook, this volume is really two books in one. Part I, en- 
titled “Basic Statistical Theory,” and the first four chapters (Chaps. 13-16) 
on regression analysis in Part II, are suitable for a one-year course in mathe- 
matical statistics taught, for example, in a department of mathematics or 
statistics, and can be understood by a student with the mathematical ma- 
turity of a mathematics major even though he may have had no first-hand 
experience with experimentation. These same chapters will be more difficult 
for the experimental scientist; here his practical intuition born of experience 
will serve him less well. 

Part II is entitled “Analysis of Experimental Models by Least Squares.” 
Experimental scientists in agriculture, biology, and social science who have 
had some experience with analysis of variance and modern experimental 
arrangements will be much more at home here in spite of the fact that the 
complexity of the algebraic manipulations involved will give them plenty 
of opportunity to exercise their mathematical skills. Here their intuition de- 
rived from experience will serve them well, and I believe that they will fully 
appreciate the import of the mathematical details even though they may 
be unable to reproduce them without aid from the book itself. On the other 
hand, the student who approaches the subject from the mathematical view- 
point cannot in my opinion fully appreciate the material presented in Part 
II until he has gained some practical experimental experience, no matter 
how adept he becomes at the mathematics. 
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The physical scientist and engineer without previous training or experi- 
ence in analysis of variance and the Fisherian principles and techniques of 
experiment design will be in much the same boat—they will have the mathe- 
matics but not the viewpoint needed to fully appreciate these chapters. Be- 
fore tackling Part II, they will do well to gain the requisite perspective by 
reading, for example, Chapters 4 (design of experiments) and 8 (analysis of 
experimental data) in An Introduction to Scientific Research, by E. Bright 
Wilson, Jr. (see preceding review). 

In the preface, the authors “welcome suggestions on methods of improv- 
ing this book both from a reference and from a text standpoint, without 
adding materially to the complexity of the material or the length of the 
book.” My own personal feeling is that a revision of this volume should not 
be attempted at all. Instead, Part II, of which Bancroft contributed the 
first four chapters and Anderson the remaining nine, should be split off from 
Part I, and issued separately as a monograph on “Analysis of Experiments by 
Least Squares, and the Method of Variance Components.” Part II is a very 
compact and lucid treatment of the mathematics underlying these distinctly 
different aspects of “analysis of variance and covariance;” is filled with up- 
to-the-minute information on these topics; and contains in the last chapter 
a useful “summary of needed research.” This material forms a very useful 
supplement to technique books on analysis of variance and the design of ex- 
periments, and in the form of a separate volume could and should be kept 
up-to-date as such. 

Except for isolated sections, Part I is essentially a compact treatment of 
statistical theory appropriate to measurement data (in contrast to enumera- 
tion data, and rank-order data). It might well be expanded a bit to make it a 
fairly comprehensive monograph on this phase of statistics. As it stands, it is 
a very readable and yet compact coverage of the elementary theory of prob- 
ability, univariate and multivariate distributions, mathematical expectations 
and moments (including moment-generating functions and cumulants), trans- 
formation of variables and derivation of sampling distributions of statistics, 
and orthogonal linear functions; together with point estimation (from the 
Fisherian viewpoint), interval estimation and tests of statistical hypotheses 
(from the Neyman and Pearson viewpoint). To round it out it needs a dis- 
cussion of the concept and techniques of “statistical control” in relation to 
measurement processes, a compact discussion of the various different 
“straight-line situations” of importance to the measurement analyst, some- 
thing on “the law of propagation of error”, and a bit on sequential analysis. 

It is my opinion that what is needed today is not more books that at- 
tempt to cover a wide (and for the most part the same) selections of ma- 


‘terial in detail, but rather, some books that provide a panoramic view of a 


large fraction of existing statistical theory and methodology, and others that 
are comprehensive monographs on some single phase in which the author is 
particularly well grounded. 
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Statistics for Sociologists. Revised Edition. Margaret Jarman Hagood and Dan- 
iel O. Price. New York: Henry Holt and Company, 1952. Pp. xii, 575. $5.75. 


A. W. MarsHatu, The Rand Corporation 


HIS is a revised version of a book published in 1941 by the senior author. 

The revisions and additions include (1) new chapters on sampling in social 
surveys and on applications of factor or component analysis in sociological 
research, (2) the introduction of the work of Guttman into the chapter on 
indexes and scales, (3) new, up to date reading references, (4) substitution of 
current illustrative materials in almost all examples, (5) reduction of space 
devoted to computational procedures, (6) systematic collection of the most 
useful formulae hitherto scattered throughout the text at the end of the text 
and, (7) deletion of Part V of the earlier work, which contained material on 
birth and death rates and the construction of life tables. All of these changes 
have undoubtedly added to the general usefulness of the book. 

This book represents an attempt to write an essentially non-mathematical 
text, for a group of students who would not understand a mathematical one, 
in order to convey to them “all of the basic statistical methods which have 
been used in sociological research, and some of the newer ones which have 
not as yet found wide application in sociological fields.” This same aspiration 
guided the preparation of the first edition 11 or 12 years ago. As then, books 
of this type for this particular audience raise very serious pedagogical prob- 
lems. This book probably succeeds as well as any book could in making some 
progress toward the above goals and so long as there exists a demand for 
texts which make such an attempt it should prove useful. 

The reviewer’s private fantasy life on how to teach statistics to novices, in 
some substantive field such as sociology, contains two elements: 

(1) A good supply of first rate papers by the leaders in the substantive 
field of study which contain well thought out applications of statistics to im- 
portant problems. 

(2) A textbook of the general type being reviewed here; i.e., one that is 
essentially an introduction to statistical tools of the trade, but with much 
more emphasis on first principles and the appropriate cautions to be ob- 
served in the use of the tools. In particular this model text would stress the 
logic of model construction and analysis of the structure of the population 
from which samples are assumed to be drawn. In addition it would have di- 
rections for the discovery of cases where sociologists or others ought to yell 
for help from mathematical statisticians, with perhaps some indications as 
to what kinds of help are probably forthcoming. (The importance of this lat- 
ter problem is very much overlooked. In the course of the reviewer’s collabo- 
ration with sociologists, when the problem they wished to deal with was com- 
pletely stated, the statistical problems involved were very frequently not 
textbook cases.) 

This model text would be slanted toward the needs of those students with 
serious research ambitions and to that extent is not directly comparable with 
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the present work and no unfair comparisons should be indulged in. The de- 
mand and audience for such a model text (if indeed the above model text is 
truly optimal) has not really developed in sociology, nor is it possible to 
satisfy condition (1) above. The authors themselves “regret that the progress 
in the application of statistics to sociological research since 1941 has not 
provided a basis for a more complete substitution of new titles” for older ref- 
erences for supplementary reading. 

Given their audience, the authors have written a book which is, if any- 
thing, a little above the average abilities of those who will read it. As the 
paragraphs immediately above suggest this is bias in the right direction. The 
book is very sound on those problems it does treat and with close reading 
no one should be given any wrong ideas about the application of statistics to 
sociological problems. 


Statistical Quality Control. Second Edition. Eugene L. Grant. New York: Mc- 
Graw-Hill Book Company, 1952. Pp. xvi, 557. $6.50. 


C. C. Crare, University of Michigan 


INCE two very good reviews of the first edition of this widely used textbook 
S appeared in this Journal, Vol. 42 (1947), pp. 180-184, it is appropriate 
first to note the more important changes in the new edition. 

The most extensive additions are in the area of acceptance sampling. Since 
the appearance of the first edition, the Military Standard 105A acceptance 
sampling tables for inspection by attributes have superseded the Army Serv- 
ice Forces tables. Professor Grant has drawn on the fine study he and Lorber 
made of the new tables to replace the section on the older tables with a very 
good and thorough discussion of the new ones. The clear explanations of the 
use of the tables, of their relation to their predecessors, and of their more 
important characteristics should be welcomed by many present and poten- 
tial users. Another new feature of importance is the introduction of accept- 
ance sampling by variables. The use of known-sigma plans for one- and two- 
sided specifications and of unknown-sigma plans for one-sided specifications 
is explained and relevant tables are provided. The U and Q tests of Schwartz 
and Kaufman are also included as is Shainin’s Lot Plot plan, the latter in a 
rather brief and non-committal fashion. The reviewer was disappointed in 
finding no critical discussion of Shainin’s widely used plan. Though group 
sequential plans come in for repeated mention and are explained, the reviewer 
was also disappointed with the very brief treatment given item-by-item se- 
quential plans. An example is given of the calculation of an attributes plan 
with the very minimum of discussion. Surely the importance of sequential 
analysis for applications merits more attention. 

There is a good deal of revision in other sections of the book, too. The 
section on the cost aspects of quality decisions has been somewhat expanded. 
Worthy of particular mention are: (1) the use of the exact value instead of the 
large sample approximation to the standard deviation of the sample stand- 
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ard deviation for small samples from a normal universe; (2) the addition of 
a brief discussion of the use of the normal frequency function as an approxi- 
mation to the binomial; (3) a modification in the handling of varying sample 
units on c charts; and (4) a discussion of the application of control charts to 
clerical work. These are all improvements. But of greater importance is the 
fact that the new edition contains 302 problems as compared with 145 in the 
first edition. Perhaps the most difficult task in writing a book of this kind is 
to provide enough good problems; Grant deserves special commendation on 
this point. 

There are also points regarding which the reviewer feels mildly critical. 
First, he does not agree that one loses any significant figures when he sub- 
tracts the square of the mean from the mean of squares in calculating a 
variance. The text appears to give the impression that one sacrifices some- 
thing in return for the convenience of the “short-cut” method. Second, the 
reviewer would have liked to see considerable emphasis on the fact that set- 
ting limits on an X chart by use of an R calculated from R’s whose chart 
shows lack of control is at best only tentative. Third, because it is so hard to 
make the point really stick, he would have been pleased by some real ham- 
mering at the fact that deleting points out of limit lines to set revised limits 
is only paper work unless the related assignable causes are actually removed. 
Fourth, on page 98 the idea of a confidence interval is introduced, though 
not so designated, at first correctly, and then marred by “a less precise but 
more common interpretation” which incorporates the common first miscon- 
ception of this notion. In fact the reviewer believes that no book on statistical 
methods, even at the most elementary level, should fail to give considerable 
attention to this important and clarifying concept. Finally, on page 412 it 
seems that some attempt should have been made to explain the difference 
in origin between the sample standard deviation, as used elsewhere in the 
book, and the square root of the unbiased estimate of the variance and it 
probably should be stated that the latter is also a biased estimate of the 
standard deviation of a normal universe. 

It should be remembered that this is a pre-calculus text and written for the 
student either in school or in a shop who has the very practical aim of mak- 
ing statistical quality control methods work. The reviewer has had a good 
deal of experience with the first edition and he is convinced that the second 
edition only keeps this book considerably the best in its field. As a text for 
engineering students who have had a year of the calculus one could wish that 
there were more relevant mathematical material and that, say, the first 
three chapters on X and R charts had been more concisely written. But the 
engineering point of view in this book is authentic and Grant keeps his feet 
firmly on the ground. This, in combination with its clear and systematic 
exposition, its wealth of examples, and its numerous problems makes its 
merits far outweigh the few imperfections noted. 
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Statistical Theory with Engineering Applications. A. Hald. New York: John 
Wiley and Sons, Inc., 1952. Pp. xii, 783. $9.00. 


G. J. LieserMan, Stanford University 


CCORDING to the author, the aim of this book is “first and foremost to 
furnish the reader with simple and practical methods—which can be 
understood and applied by non-statisticians—for the handling of the ma- 
jority of the problems which occur in everyday work.” Hald has not only 
accomplished this, but, in addition, has provided the statistician with a ready 
reference book containing an abundance of interesting examples drawn from 
the engineering fields. Furthermore, this book will be useful as a supple- 
mentary text in many of the introductory courses in mathematical statistics, 
inasmuch as the treatment of the topics assumes a minimum knowledge in 
mathematics of the calculus. 

The book devotes chapters to the basic statistical topics including the 
calculus of probabilities, distribution theory, limit theorems, analysis of 
variance, design of experiments, regression, and correlation. Hald’s treat- 
ment of most of these topics is excellent. For example, for an elementary text, 
his axiomatic approach to probability theory is quite refreshing. His han- 
dling of degrees of freedom as related to linear restraints on the random 
variables is much more mature than that found in the usual texts. 

For the practical man, the oustanding achievement of the book is the very 
complete treatment of the usual standard techniques in the handling of data. 
For example, the chapter on Regression contains a section on solving the 
Normal Equations by the Doolittle method. Part of a chapter is devoted to 
criteria for rejection of outlying observations. Almost any topic from a treat- 
ment of the Central Limit Theorem to a section on Punched Cards can be 
found in the book, together with illustrative examples. From a practical 
point of view, Statistical Theory is far superior to anything that has been 
written to date. 

The only criticism of the book is that it is somewhat incomplete from a 
theoretical point of view. Although Hald states that “a general exposition of 
the theory of statistics lies outside the scope of the present publication,” the 
reviewer feels that because the author presents a great deal of theoretical 
as well as applied statistics, certain topics should have been included. There 
is nothing said about either moment generating functions or characteristic 
functions. The sections devoted to statistical inference are very weak. The 
likelihood ratio test is never mentioned. Maximum likelihood is inadequately 
discussed as is the whole Neyman-Pearson theory. Perhaps this criticism is 
due largely to the fact that Hald succeeds in accomplishing more than he 
originally set out to do, thereby suggesting that he could easily have accom- 
plished still more. 

An interesting and most welcome feature found at the conclusion of many 
of the chapters is a section devoted to “Notes and References.” In this sec- 
tion, Hald describes the history of the contents of the chapter, and acknowl- 
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edges the authorship of many of the proofs of the theorems presented. For 
example, at the end of the chapter entitled “Fundamental Calculus of Prob- 
abilities,” Hald writes: 


The calculus of probabilities was developed during the seventeenth cen- 
tury in connection with the solution of problems of games of chance. The 
classical definition of probability given in §1.4 took form in a correspondence 
between Pascat and Frermart in 1654. On basis of this definition an e:xten- 
sive mathematical theory was developed. Important contributions were 
made by J. BERNouLLI in Ars Conjectandt, 1713, and P. S. pz Lapuacz, 
who in Théorie analytique des probabilités, 1812, gave a very extensive and 
systematic exposition of the results and methods of the calculus of proba- 
bilities. ... 

The construction of a calculus of probabilities from a set of axioms, as 
done in §1.3 is analogous to the way in which other branches of applied 
mathematics have been built up. The first systematic exposition of this 
method and its consequences was given by A. Kotmocororr: Foundations 
of the Theory of Probability, Chelsea Publishing Company, New York, 1950, 
originally published in German in 1933. An elementary exposition of the 
modern mathematical concept of probability may be found in P. R. Hat- 
MOS: The Foundations of Probability, Amer. Math. Monthly, 51, 1944, 493- 
GAB. «i: 

An elementary exposition of the different concepts of probability from a 
philosophical point of view can be found in E. NaGgE.: Principles of the The- 
ory of Probability, International Encyclopedia of Unified Science, Vol. 1, 
No. 6, Chicago, 1939. 


Published in conjunction with this book is a separate volume of statistical 
tables and formulas. This supplement is one of the most extensive books of 
tables published to date. Included in this volume are tables of the normal 
distribution, the ¢-distribution, the x?-distribution, the F-distribution, the 
distribution of the range, probits, confidence limits for the parameter of the 
binomial distribution, random numbers, ete. 

Not only is Statistical Theory a worthwhile addition to the family of books 
in statistics, but it is unique among books on statistics in that a quotation 
from George Bernard Shaw is used to illustrate a point. Hald states: 


Thus the concepts of stochastic and causal dependence must be carefully 
differentiated. Bernard Shaw in his characteristic manner illuminates this 
point in the following quotation from the section “Statistical Illusions” of 
the preface to The Doctor’s Dilemma: “Thus it is easy to prove that the 
wearing of tall hats and the carrying of umbrellas enlarges the chest, pro- 
longs life, and confers comparative immunity from disease; for the statistics 
shew that the classes which use these articles are bigger, healthier, and live 
longer than the class which never dreams of possessing such things. It does 
not take much perspicacity to see that what really makes this difference is 
not the tall hat and the umbrella, but the wealth and nourishment of which 
they are evidence, and that a gold watch or membership of a club in Pall 
Mall might be proved in the same way to have the like sovereign virtues.” 
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Modern Elementary Statistics. John E. Freund. New York: Prentice-Hall, Inc., 
1952. Pp. x, 418. $5.50. 


Norman Rupy, Sacramento State College 


TTEMPTs to shift the emphasis in introductory statistics courses from de- 
A scriptive to inductive statistics have suffered from the absence of suita- 
ble textbooks and from the poor preparation in mathematics of most college 
students. The book reviewed here frankly emphasizes inductive statistics 
and, according to the preface and the publisher’s advertisements, requires 
only a minimum amount of mathematics. On this basis alone, the book de- 
serves the close attention of those concerned with the introductory statistics 
course. 

The author states that “The order and the emphasis of the material cov- 
ered follows the modern trend in the teaching of statistics—to include in- 
formally topics that in the past have often been taught only on an advanced 
level.” At other points, he emphasizes that the meanings of statistical ideas 
are of greater importance than the formulas. 

Despite certain criticisms to be made later, the reviewer’s over-all impres- 
sion is that the book will perform a valuable service in a general introductory 
course. The concept of a sampling distribution is very well developed and is 
emphasized whenever a statistic is introduced. Undoubtedly, the student 
who reads this book carefully will absorb what many believe to be the single 
most important idea in a beginning course, namely that of variability in the 
possible outcomes of a sample. The discussion of confidence intervals and 
their interpretation is also very good, and is strengthened by the construc- 
tion of intervals for a non-normal distribution. The informal manner of 
presentation employed and the general tone of the book are consistent with 
what appears to be a current trend toward the humanizing of statistical 
ideas, comparable to the current humanism in the natural sciences. 

The adoption of an informal manner of presentation, which is virtually a 
necessity in the introductory course, places upon the author the responsibil- 
ity of providing enough exposition to insure that the meanings are clear. One 
of the weaknesses of the book lies in the fact that too often the meaning is 
subordinate to the formula, too often the elements of the formula are not 
justified heuristically or intuitively. A few examples will illustrate this criti- 
cism. 

In the section entitled “Some Rules of Probability,” the notions of con- 
ditional and joint probability are presented without benefit of any graphical 
or tabular aids, such as the two-way frequency table, found in the books by 
Wilks and Duncan. Again, in the discussion of confidence intervals for pop- 
ulation means, using small samples, the factor ./n—1 appears without any 
mention of degrees of freedom. In fact, the concept of degrees of freedom is 
not mentioned at any of the three places at which reference is made to the 
“¢” distribution. In yet another instance, the x? distribution is discussed in 
three different applications: first, as employed in the construction of confi- 
dence intervals for the population standard deviation; second, as a test for 
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association in contingency tables; and, third, as a test of goodness-of-fit. In 
the first application, the notion of degrees of freedom is not mentioned, while 
in the other two it is. There is no connection made between the two formulas 
given for x?, and no heuristic justification given either for squaring the dif- 
ference between observed and expected frequencies or for dividing the square 
by the expected frequency. 

The other general criticism which can be made of the book concerns the 
omission of some ideas of modern statistics which are important and basic 
enough to be considered in an introductory course. In the discussion of hy- 
pothesis testing the reader is advised to avoid the type II error (accepting 
the null hypothesis when it is false) by never accepting the null hypothesis 
at all. By this approach, the author precludes any possible discussion of the 
theory of statistical decision,’ certainly as important a topic in the intro- 
ductory course as the use of runs in testing for randomness. (The latter gets 
eleven pages of text and two pages of tables.) The omission is especially dis- 
turbing because in each of the two examples used, the decision to reserve 
judgment on the null hypothesis is also a decision whose consequences should 
be considered. With the type II error eliminated from consideration, the 
selection of the level of significance (only two-sided tests are considered) is 
made by following “the customary rule” of .05. This advice is not at all dif- 
ferent in spirit from the 3 rule found in older textbooks. 

Another major omission, in the reviewer’s judgment, is the complete ab- 
sence from the sections on correlation and regression of the concept of a 
“model” or of any indication of the importance of a priori considerations in 
scientific investigations. According to the text, the treatment accorded the 
data (linear or curvilinear fit) is determined by inspection of the data itself, 
and since curvilinear fitting is taken as being outside the scope of the book, 
there is no need to discuss criteria for choosing between linear and curvilinear 
regressions. In the discussion of the standard error of estimate (in which 
no mention is made of degrees of freedom), it is incidentally revealed that X 
and Y are assumed to be normally distributed [a bivariate normal distribu- 
tion?]. As usual, no mention is made of the fact that the prediction of Y on 
the basis of a given X is actually that of the average Y corresponding to the 
given value of X. 

A final comment, lest these criticisms lead the reader to overlook or dis- 
count the comments made in the third paragraph: The reviewer has adopted 
the book for a general introductory course. 


Elementary Statistics. Revised. Morris Myers Blair (Professor of Economics, 
University of Tulsa). New York: Henry Holt & Co., 1952. Pp. xiv, 735, $5.50. 


R. Cuay Sprowts, University of California (Los Angeles) 


HILE reading the first chapter of this book, my pulse quickened in an- 
ticipation of a “new” book in the field of elementary statistics. The 
author states: “Because life is so short that we cannot know all about any- 
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thing, we have to make our decisions on the basis of sample information. 
This is the statistical method;” and, “Statistics as a science in its broader 
meanings is not only a means of guiding our judgment in making the daily 
decisions of life, but also a device or tool for discovering new truths. It is one 
of the most powerful engines for research.” From here to the end of the book 
my pulse was normal except for one small flutter on page 208 where the 
author writes: “But analysis is usually not an end in itself. One analyzes in 
order that he may comprehend and forecast, infer, project, and apply his 
information for further practical or theoretical use. The great end of sta- 
tistical description and analysis is inference.” If the author really believes 
this, why did he not orient his textbook in this direction? 

Is it because, as he says, “He is convinced that statistics should first be 
taught to most students from the standpoint of preparing them to be con- 
sumers of statistics instead of creators of research?” What better way to 
teach students to be intelligent consumers (even of descriptive statistics) 
than to make them create one small piece of research in which they them- 
selves analyze a problem, specify the data needed, collect that data, make 
the necessary estimates, compute their errors, and then draw an inference? 

Or is it because he thinks of a very selective area of consumption? “Many 
farmers and small merchants read carefully the price quotations, deliveries 
of grains and livestock ... tables and charts... in the daily newspapers. 
These people are consuming statistics.” I infer from this that he wishes to 
teach people how to interpret statistical data—an admirable wish. These 
people and others are also continually faced with making decisions which 
are based upon incomplete information and, therefore, subject to uncertain 
outcome. Implicitly or explicitly, they deal in probabilities. What better 
place than in elementary statistics to contribute to the general education by 
introducing the statistical method with its emphasis upon choice among al- 
ternatives and the probability of being wrong in that choice? 

Granting that Blair is correct in appraising the consumer area, I believe 
that there are serious omissions in the text. I shall cite a few examples. In his 
discussion of the range (p. 152) he does not mention the interpretative falla- 
cies inherent in the dependence of the magnitude of the range upon the 
sample size. In the discussion of regression analysis there is no cautioning 
about the value of the intercept often being an extrapolated value subject to 
large error and often an impossible value in the particular problem. Spurious 
correlation, a serious pitfall for the consumer, is not mentioned; neither is the 
frequently recurring trap of the regression fallacy. On p. 236 it is stated that 
rank correlation methods are not widely used. This Journal recently featured 
an article on the use of ranks which listed sixty-nine works in its bibliogra- 
phy; fifteen of these had the word rank in their title and others, I am sure, 
also discussed ranking methods. In the discussion of stratified sampling (p. 
370), there is nothing to indicate that the real gains from stratification occur 
when the conditions are such that individual strata are sampled dispropor- 
tionately to their size in the population. In the chapter on seasonal variation 
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(chapter 16) he gives three methods of calculating seasonal variation, but 
nowhere do I find (a) an explanation of a seasonal index of, say, 120 for a 
particular month; (b) any mention of adjusting data for seasonal variation; 
‘(c) an interpretation of a seasonally adjusted value. 

It seems to me that Blair has not only missed a consumer area to which 
statistics has much to contribute, but that he has also failed to write a book 
which is successfully consistent with his own original intent, both because of 
these omissions which I have mentioned and because he continually drags 
the student through a mass of computations (a function of the producer?) to 
the neglect of interpretation. In my judgment, this is just another textbook 
in elementary statistics. It may be preferred to another because the first 
edition (1944) has been used in the past, because there are elaborate work- 
sheets which aid students (a valuable service), or because it may be a con- 
venient handbook. In other respects it is not distinguishable from a dozen 
or so other elementary textbooks in statistics. 


Statistical Methods for Social Workers. Wayne McMillen. Chicago: The Uni- 
versity of Chicago Press, 1952. Pp. xi, 564. $6.75. 


DaniEL O. Price, University of North Carolina 


ype stated aim of this book is to provide an orderly introduction to de- 
scriptive statistics for social workers (vii). In the opinion of this reviewer 
it misses the mark considerably. It is uneven in its treatment of material and 
contains several serious statistical flaws. It deals almost entirely with the 
field of descriptive statistics and contains the statement, “Perhaps in the 
long run inductive statistics will make contributions to human knowledge 
that clearly entitle it to be regarded as of far greater value to society than 
descriptive statistics” (vii, reviewer’s italics). The first five chapters deal 
with the collection, editing, tabulation, and presentation of data. 

Although the author seems dubious of the value of inductive statistics to 
social workers, the two illustrations on page 161 are both examples of induc- 
tive statistics. 

In Chapter IX on the Mean, Median, and Mode it is not made clear that 
the “long” and “short” methods of computing the Mean produce identical 
results with the exception of rounding errors. In fact one gets the impression 
that the “short” method is an approximation to the results of the “long” 
method with the statement, “Thus the two methods produced substantially 
the same result” (p. 234). 

Chapter X is a 17 page chapter on the Geometric Mean and Logarithms. 
Considering the doubtful value of the geometric mean to social workers this 
space might well have been utilized further in the book in the treatment of 
statistics of relationship. 

In the chapter on Measures of Absolute Variability the author confuses 
the binomial and normal distributions, actually using a binomial distribution 
as his illustration on page 270 and calling it a normal curve. The binomial 
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distribution is not mentioned. In this same chapter eight pages are devoted 
to the mean deviation, a measure of limited usefulness, and only six pages 
are spent on the standard deviation, one of the most basic statistical meas- 
ures. It is also in this same chapter that reference is made to Appendix P, a 
four page appendix on how to extract a square root. At no point is a reference 
made to Barlow’s Tables of Squares, Cubes, Square Roots, Cube Roots, and 
Reciprocals. 

Chapter XIII deals with Ratio Background or semilogarithmic paper. This 
material really belongs in the early chapter on graphic presentation. 

Although the book deals primarily with descriptive statistics, Chapter XV 
is an 11 page chapter on Sampling which contains several errors sufficiently 
serious to deserve comment. No distinction is made between a systematic 
sample and a random sample, the formula for the estimated standard error 
of the mean does not include the loss of a degree of freedom for use of the 
sample standard deviation instead of the universe standard deviation, the 
t distribution is not even mentioned, and the probable error is used incor- 
rectly. On page 340 we find the statement, “The mean of the sample plus 
and minus the probable error defines the range within which half the sample 
means may be expected to fall.” Erroneous interpretations such as this are 
part of the reason for dropping the probable error from modern statistical 
usage. Confidence limits are not mentioned. 

In Chapter XVI on Time Series a “derivation” of the two normal equa- 
tions for getting the least squares straight line is given. The only difficulty is 
that what is given is not a derivation since the criterion of minimizing the 
sums of squares of deviations is not utilized and this characteristic of the 
least squares straight line is not mentioned until two pages later. 

Chapter XVII on Correlation and Contingency leaves so much to be de- 
sired that by itself it would make the book of very limited value even if there 
were no shortcomings elsewhere in it. The only method given for computing 
a correlation coefficient is by first determining the regression line, then com- 
puting Y, corresponding to each Y, finding the difference between Y and Y, 
and squaring these differences, then finding the difference between each Y 
and the mean of Y and squaring these differences, then from these two sets 
of squares of differences determining the standard error of estimate and the 
standard deviation, and utilizing these two measures to find the correlation. 
The procedure in itself would discourage most students from ever computing 
a correlation coefficient. No real understanding of the correlation coefficient 
is shown and no test of statistical significance of a correlation is given. In fact 
the author says, “Unfortunately, this cannot be done” (p. 386). His confusion 
seems to lie in not distinguishing between statistical significance and practical 
importance. 

The author gives two illustrations of high correlation between time series 
and suggests that the high relationship observed is due merely to chance, 
overlooking the fact that the correlation is due to the close relationship of 
each variable to a third variable, time. 
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As a suggestion for simplifying the computation of a correlation coefficient, 
the author suggests using the Spearman rank correlation as an estimate of 
the actual correlation between two variables and shows how this is com- 
puted. He gives no indication of the differences in assumptions and in utility 
of the two types of correlation coefficients. 

As a whole the book has a point of view of statistics which was fairly gen- 
eral about 30 years ago. A tangible illustration of this is the fact that the 
book at no point assumes that the student has access to a calculating ma- 
chine or even mentions a calculating machine. This is definitely not a modern 
statistics text and it is to be hoped that it will not set the tenor of statistical 
training among social workers. 


Punched Cards: Their Applications to Science and Industry. Robert S. Casey and 
James W. Perry, editors. New York: Reinhold Publishing Corporation, 1951. 
. Pp. viii, 506. $10.00. 


Harry P. HarTKEeMEIER, University of Missouri 


HIs book reveals the fact that chemists are now facing the same problem 

which confronted statisticians about 70 years ago. Statisticians trying to 
handle by hand the large volume of data collected in the 1880 U. S. Census 
took over seven years to organize and make available the information ob- 
tained. Chemists working now on the Gmelin Handbuch der anorganischen 
Chemie think that it will take 10 to 12 years to complete the work and they 
hope to complete the 8th edition “about 1960.” This work is taking so long 
that many chemists fear, and others are convinced, that the era of the classi- 
cal handbook is approaching its end. The parallel is striking. Statisticians in 
the ’80’s stated that the U. S. Census of 1890 would take more than 10 years 
to organize by hand and another census would have to be started before the 
information from the previous one could be made available, so some more 
rapid method of handling the data was imperative. This information was 
presented to some people who set about to invent and develop the Hollerith 
machines. Now, over half a century later, chemists are turning to the same 
punched-card machines to solve a similar problem—that of making available 
the pertinent information quickly, before it is made useless by the passage 
of time. 

An alternative to periodic handbooks is the mechanical information center. 
Various parts of this book point out that similar problems have arisen in 
other fields, such as searching U. 8. patent disclosures, hospital records, 
medical reports, library record cards, etc. Scientists have been forced to con- 
sider mechanical methods of searching indexes, and this book contains an 
excellent and clear discussion of the difficulties of using standard punched- 
card machines for searching such indexes. Although codes may be devised to 
classify books by size, date of publication, number of pages, etc., so that 
when a book is assigned to one class it is excluded from all others, a code for 
subjects presents a different problem, for the assignment of a book or article 
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to one subject class does not exclude it from ail other subject classes. To 
classify a book under two or more subject classes requires the use of two or 
more vertical card fields. This also usually requires that the indexer decide 
which is the major subject field and which are minor subject fields. A person 
searching for all books which deal with a given subject might want to examine 
a book even though it is indexed under a minor field. As the number of fields 
increases, the standard machine cannot search all of them at once, or on one 
run of the cards, for all books dealing with a certain subject, or search for all 
books dealing with a combination of subjects. 

A person, for example, who wants to locate all books dealing with the use 
of statistical methods to control the quality of a chemical compound used in 
a recently patented process to manufacture a new plastic would like to be 
able to locate all books dealing with this combination of subjects regardless 
of whether the book is primarily one on statistical analysis using this illustra- 
tion of quality control methods, or primarily a book on plastics which in- 
cludes a discussion of the fact that this particular plastic would not have been 
possible without uniform quality of a chemical compound obtained by using 
statistical methods of quality control, or even a book on patent laws that 
happened to contain an illustration of a court case arising out of the use of 
this plastic and involving the presentation of evidence including quality 
control charts. It would be nice if the machine would sort separately cards for 
books involving all of the subjects desired, all but one, all but two, etc. This 
book points out the need for machines that will accomplish such mechanical 
searches on one run of the cards and probably stimulated the research that 
resulted in the perfection of the machine described in Library Applications of 
Punched Cards by Ralph Parker (American Library Association, Chicago, 
1952). The use of horizontal fields instead of vertical fields permits the ma- 
chine to search 12 fields in succession on one run of the cards. Each of the 
12 subject fields will accommodate a binary code number equivalent to 10 
decimal places in the Dewey decimal system. 

This book contains the case histories of many punched-card applications 
and the editors are to be congratulated upon the completion of a considerable 
project involving many people in widely scattered locations. “The hand- 
sorted edge-punched cards are discussed in greater detail than the machine- 
sorted cards. In fact, one object has been to make the book serve as an oper- 
ating instruction manual for the edge-punched cards. It is not possible to do 
the same for machine-sorted cards within the scope of this book” (p. iii). 
The reviewer grants that this is the first and only book presenting detailed 
information on all hand-sorted edge-punched card systems, but he is also 
convinced that anyone adopting a hand-sorted system will find it to be only a 
temporary solution. Even for such prospective users the book may be dis- 
appointing because “there is no attempt to make critical comparisons” (p. 
39). In a book written by so many people in different locations it is very diffi- 
cult to avoid some duplication and a few statements that some may question. 
For example, “The only known correct way to sample is by the use of random 
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numbers” (p. 412), and “The mathematics of ‘goodness of fit’ have not yet 
been adequately investigated” (p. 413). 

This book is not very suitable as a college textbook but it should be in 
most libraries to provide supplementary assignments. Many scientists, in- 
formation specialists, executives, and office managers will find it useful as a 
reference. 


Population, Food, and Economic Progress. Merrill K. Bennett. Houston, Texas: 
The Rice Institute, July 1952. Pp. 68. Paper. 


Coun Cuark, The Bureau of Industry, Brisbane, Australia 


HIs pamphlet is a triumph of common sense. Such a world do we live in, 

that we are now beginning to regard common sense among learned men 
as one of the rarer virtues. A very well known professor of theoretical eco- 
nomics was recently heard to grumble that men of common sense never 
looked at statistics, and that men who studied statistics always seemed to 
leave their common sense behind them. 

Dr. Bennett’s work should now be well known, not only for his studies at 
the Food Research Institute of Stanford University, but also for his work in 
the field of international comparisons of levels of living, or “real income.” 
(American Economic Review, 1951). We have not, and will not have for many 
years to come, accurate information about income levels in the under-de- 
veloped countries, and the United Nations statistics of average income per 
head in these countries are not (speaking with all restraint) what they appear 
to be. Dr. Bennett devises a most ingenious statistical technique for combin- 
ing all the scattered scraps of information available for most of these coun- 
tries, such as the number of automobiles, telephones, high school students, 
and so on. The present reviewer had set out to measure the incomes of some 
of these countries quite differently, and the check with the Bennett method 
proved to be extremely satisfactory; so he is attempting to extend its sphere 
of application to more countries and to more recent years. 

One of Dr. Bennett’s principal figures is taken from FAO publications 
(which perhaps need no more qualification than they have received), namely 
calories of food consumption per head of population. He begins by reminding 
us of an obvious fact, the neglect of which has caused endless trouble— 
namely, that if statistics show that Europeans and North Americans con- 
sume 3,000 calories per head per day, and the inhabitants of southeast Asia 
2,000, the latter figure, per kilogram of average body weight, may be greater 
than the former. 

The next point which he makes is that the greater part of the world did 
and does obtain its calories from grains and roots, but that we all like to con- 
sume a substantial proportion of animal foodstuffs when we can afford them. 
In the past, this was the privilege of the wealthy on the one hand, of nomadic; : 
huntsmen and herders on the other; for obvious reasons, the latter category 
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now represents a far smaller proportion of the earth’s population than it did 
in the past. 

The next obvious fact which we tend to forget, and to which Dr. Bennett 
forcefully draws our attention, is that throughout the greater part of human 
history mankind must have had an inappreciable, if any, rate of population 
increase. In 1650, when modern population estimates begin, the world had a 
population of about five hundred millions. Biologists have contended that 
the human race has been in existence for five hundred thousand years, 
though some of them have recently raised the estimate to a million years— 
even if we assume that they. have a couple of zeros out of place, we are still 
left with the same problem. Whether we assume that the human race started 
with two people, as Christians believe, or take any larger number predicated 
for us by biologists, and whatever conceivable date we take as the starting 
point of the human race, we are still left with the conclusion that any rate of 
population increase even remotely comparable with those which we know 
now would have produced a far larger world population by 1650. This accords 
with the observations of anthropologists that primitive people produce large 
families, but that there is little or no net rate of population increase among 
them. Dr. Bennett has employed a team of historical students who follow 
world population data back to the year 1,000 A.D., and find plenty of evi- 
dence of stagnant or indeed declining populations, in all those historical 
periods when political order broke down. There is some evidence of heavy 
population declines, both in Europe and Asia, during the centuries of warfare 
and disorder between 300 and 800 A.D. 

The reviewer would, however, disagree with Dr. Bennett’s results on 
India. There, it appears, population rose to a maximum about the middle of 
the seventeenth century, under the despotic but ordered rule of the Mogul 
emperors; and in the ensuing two centuries of anarchy and warfare remained 
stationary or declined until order was again restored under British rule in the 
nineteenth century. 

The writings of Malthus, which have enjoyed such a comeback among 
American intellectuals in the present generation, are thus found to be a 
theoretical speculation not valid historically—nor, for that matter, did Mal- 
thus make a valid estimate of the possibilities of agricultural development. 
Human capacity to reproduce is not limited. Demographic studies of primi- 
tive peoples, showing an average of six or seven children born to a woman 
who lives to the end of her reproductive period, agree well with the estimates 
of the medical sub-commission of the recent British royal commission on 
population, as to what would be the total average fertility of the modern 
English woman, if she married early and imposed no restrictions on repro- 
duction; and also agree with recorded total fertility in some modern com- 
munities like Brazil, and sample areas in China. But under primitive condi- 
tions, or under the conditions of warfare and anarchy with which so much 
of the history of civilization is disfigured, this rate of reproduction will hardly 
allow the human race to maintain its numbers. Population increases only 
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occur where mankind can establish a stable political order—a surprisingly 
difficult thing to do—and at the same time, accumulate sufficient medical 
knowledge to control the ravages of disease—a combination only found in 
the modern world, except perhaps for some periods under the ancient Greeks 
and Egyptians. 

And when such order and increase have been established, it is surprising 
how short is the time interval—as both modern experience and ancient his- 
tory teach us—before mankind, of its own accord, starts in various ways to 
reduce its rate of reproduction. We know our own nineteenth century his- 
tory; in Japan and Russia we can trace, with reasonable accuracy, the fall in 
the average total fertility from six to four over the short period of the last 
thirty or forty years; and very few people are aware that a similar trend is 
occurring in India, where the rate of population increase is rapidly decelerat- 
ing. 

In his final and magnificently common sense conclusion, therefore, Dr. 
Bennett sees “a touch of hysteria” in “current attitudes towards what is 
called the population problem,” or the fashionable idea that America ought 
to send what might be called missionaries of contraception to inform the be- 
nighted Asiatics. Even in the modern world, areas of excessive population 
density are rare, and most of the habitable world is still uninhabited. War 
and anarchy in the past have not arisen in over-populated regions—rather 
the reverse, among the people of sparsely settled grazing areas, who wished 
to live on the labor of others, rather than cultivate land which was available 
for settlement. Calculations of the remote future at which the world, at its 
present rate of increase, will become finally overcrowded, are of little interest 
—a purely arithmetic exercise, and about as sterile as the favorite old compu- 
tation of what would happen if a man left one thousand dollars to be in- 
vested at compound interest for the benefit of his descendant in six hundred 
years time. 


Census of Manufactures: 1947: Indexes of Production. Bureau of Census and 
Board of Governors of the Federal Reserve System. U. S. Government Printing 
Office, 1952. Pp. viii, 99. $1.75. 


Paut B. Simpson, University of Oregon 


5 pry volume explains the material, methods, and results of computation of 
of an index of manufacturing based on the 1947 and 1939 Census of 
Manufactures. It extends the work of Solomon Fabricant: The Output of 
Manufacturing Industries, 1899-1937 (National Bureau of Economic Re- 
search, New York, 1940), both in years covered and methods employed. The 
bulk of the work comprises detailed tables of quantity and value of output, 
industry indexes of output, employment, and weights. The text explains 
methods used, and the reasons therefor. 

The final index for 1947 relative to 1939 is 174, indicating that this eight- 
year period witnessed a record percentage rate of manufacturing growth in 
the United States. The percentage increase from 1921 to 1929 was somewhat 
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more, namely 88 as against 74, but this was accomplished during passage 
from a depressed year, 1921, to a peak year, 1929, whereas 1939 was a near 
record year of industrial production. Jim Corbett fans may be interested to 
know that in the ten-year period 1899 to 1909, manufacturing increased 56 
per cent. The growth since pre-war years has been large by nearly any stand- 
ard. 

The greatest task facing any index computor is obtaining data comparable 
over time. The Census and Federal Reserve Board collaborators have done a 
remarkable job in obtaining as much such data as they have. The work in- 
volved in such an undertaking may be illustrated with a random quotation: 


Census figures for “other natural cheese” in 1947 represent “shipments and 
interplant transfers.” These figures are not comparable with 1939 “produc- 
tion” statistics, since they exclude natural cheese made into process cheese. 
.. + Hence a United States Department of Agriculture quantity figure, 
which includes this production was used for 1947” (p. 77). 


It requires a special talent of patience, imagination, and devotion to ac- 
curacy to ferret out such detailed information. The statistical world and the 
nation owes a debt to those who have repeated the task a thousand times 
over in preparing this index and other economic series. The quotation also 
serves to remind us that pretentious economic categories such as supply, 
marginal efficiency of capital, and gross national product are compoundings 
of matters no more mysterious than cheese. 

The Census Bureau could confine its work to collecting information and 
turning it over to the public for analysis. The present volume is ample testi- 
mony that further stages of analysis are desirable. The comparison of 1947 
and 1939 data was accomplished better on home grounds than it could have 
been elsewhere. Indeed, this reviewer believes that there would be a net gain 
if the scope of the index construction activity had been expanded still further. 
Measurement cannot be separated from economic analysis, and the closest 
integration is desirable. Below are some examples of points where the com- 
putation of the index raises problems of general economic analysis. 

The index formula used is a cross-weight formula, the Marshali-Edgeworth 
formula using 1939 and 1947 value-added-per-unit weights. With simple 1939 
weights, the index would have been 184, and with 1947 weights, the index 
would have been 169. This difference indicates that quantity of production 
increased most for those items whose prices increased least. In supply and 
demand terms, this says that for those items whose demand schedules in- 
creased the most, supply schedules increased proportionately more. If we 
look not at the total manufacturing index but at industry groups, a related 
tendency is manifest. The difference between group indexes based on the 
two weights is greatest where production increased the most, the 1947 
weights yielding a lower index. Thus leather products, whose physical pro- 
duction increased only 15 per cent from 1939 to 1947, had commensurate 
price increases, with the result that the two weight systems yielded the same 
index. On the other hand non-electrical machinery increased in production 
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177 per cent from 1939 to 1947 as measured by the 1939 weight index, and by 
160 per cent as measured by the 1947 weight index. The Spearman coefficient 
of rank correlation of 1939 indexes and the difference of 1947 and 1939 weight 
indexes for nineteen groups is .74. Assume that for those groups where growth 
was greatest, the dispersion of the amounts of growths of individual indus- 
tries is largest. This assumption, similar to one frequently made in stratified 
sampling, is reasonable here. Using this assumption, we reach the conclusion 
that within most groups of industries, as in the total manufacturing group, 
supply reacted proportionately more strongly to larger increases in demand. 

Let us suppose provisionally that the explanation of these facts lies in 
economies of large scale production. Such economies would presume down- 
ward sloping long-run supply curves, and would become operative exactly in 
those industries whose capital expansion was the greatest, and those would 
likely be industries where demands increased most. The observed facts would 
be explained nicely. We now have a new understanding of the different 
meanings of the indexes based on 1939 and 1947 weights respectively. The 
1939 weight index approximates the number of years it would have taken 
the 1939 industry, with its cost structure, to produce the 1947 output. The 
1947 weight index is the reciprocal of the fraction of a year that it would 
have taken the 1947 industry to produce the 1939 output. The cross weight 
index is a measure based on the operation of some intermediately adjusted 
productive system. The problem of weighting methods has found solution in 
terms of economic meaning. 

It may be objected that the suggested explanation may not be the correct 
one. There are indeed other possible explanations of the response of supply to 
demand. Monopoly and wage rate effects are possibilities. Another is the 
observable fact that two groups of industries showing less than average 
growth in volume of output, namely petroleum and coal products and lumber 
and lumber products, are closely tied to natural resources. Were such re- 
source limitations the cause of the low output expansion and high price reac- 
tions? To what extent do measures of manufacturing production reflect such 
limitations? A discussion of the questions rising from theory and measure- 
ment would be very illuminating. 

Another aspect of the weighting problem is its relation to national income 
measures. Value added by manufacture is the basis for desirable weighting, 
because it eliminates duplication in value of commodities. However, from the 
standpoint of the economy as a whole, duplication may still arise, because 
taxes, advertising, insurance, professional services and the like are repre- 
sented in value added by manufacture, though they have separate represen- 
tation in national income. The United Nations Statistical Office has recom- 
mended the use of weights excluding such economic activities. The writers of 
the work in review have handled the problem only by noting that prepara- 
tion of such weights, others than those based on value added by manufac- 
ture, “was not found feasible” (p. 3). Surely the question of the position of 
the manufactures index in all output deserves more consideration than this. 
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It is worth noting in this connection that the differences of national income 
and value added by manufacture may be just as important as changes in 
value added by manufacture per unit of output in time. Total value added 
by manufacture in 1947 was $74,428 billions, of which 12.2 per cent or 
$9,053 billion was due to food production. National income originating in 
manufacturing, according to Department of Commerce estimates, was 
$59,459 billions in 1947 of which 9.8 per cent, or $5,822 billions, were al- 
located to food products. The differences demonstrate that this weighting 
question warrants careful analysis. Also it would be desirable to compare 
the production indexes with other value and price information than Census 
data, which information might have been used for independent testing of the 
consistency of different source data. 

The definition of physical output as value of output at constant prices 
would work well if prices stayed constant, if the physical nature of goods 
remained unchanged, and if the definition is significant to economic analysis. 
All three, however, are questionable. Quality changes particularly embarrass 
the authors. They observe: “Technological change may make possible the 
substitution of less expensive materials without affecting the quality of the 
product” (p. 7), and again, “The (quality) changes are difficult if not im- 
possible to measure quantitatively, and the failure to reflect them in the 
present index, results in a downward bias” (p. 7). These two statements 
imply that a product is defined as a capacity for doing a particular job in 
production functions and consumer preference functions. If this is to be our 
definition of output, let us have it in the open and apply it the best we may. 
(It will lead to surprising results in many cases such as rubber tires, and 
nylon socks.) It also appears at conflict with a weight selection which reflects 
cost conditions in different industries. Economic theory has various defini- 
tions of output. These should not be ignored. 

Questions of economic analysis arise in connection with the otherwise ex- 
cellent studies of representation of the “missing” industries, that is, in- 
dustries for which physical production data of any sort are largely missing, 
covering about one-fourth of all industries in terms of value added by 
manufacture. The selection of a method for measuring output in these in- 
dustries was based on studies of what method would have worked best in 
industries where quantity data are available, if they had not been available. 
A critical test was to determine which of three artificially constructed indexes 
came closest to the originally computed index. The assumptions used in 
constructing these artificial indexes were: (1) temporal change in value of 
output per unit of physical product was similar among related industries, 
(2) temporal change in value added by manufacture per unit of physical 
product was similar among related industries, and (3) the temporal change 
in output per man employed was similar among industries. The winner in 
this interesting contest was number three, which yielded the highest index 
for 1947 by some six points, or three per cent in the case of the output as- 
sumption (1), and by half as much for the value added assumption (2). 
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Economic theory poses some questions about this selection of method. The 
“missing” industries are particularly prominent in machinery industries, 
fabricated metal products, and miscellaneous /«dustries, which are industries 
whose production has grown particularly fast. This fact suggests a question 
of the following kind: Is the price behavior of these industries to be ex- 
plained by inefficiencies arising from dynamic change, by increasing effi- 
ciencies due to economies of scale, or by wage rate developments? So far as 
we can judge by price changes, the answer would appear to be that the 
second possibility is most likely, since generally price rises were less in the 
more expanding industries. This suggests, in turn, that output per man rose 
more in the “missing” industries than in others, suggesting, in turn, that the 
output per man selection may have understated the index, though by less 
than the other two alternatives considered. A discussion of these theoretical 
questions would be illuminating both to economic analysis and to measure- 
ment. 

The reviewer believes that theory or explanation is inseparable from 
measurement. At the moment, measurement is ahead of theory in the sense 
that we have measures such as the index of manufacturing whose significance 
is not fully understood because we do not know the underlying forces ac- 
counting for the particular observed changes. Measurement would be im- 
proved if answers were sought to specific questions of the following nature: 

(1) How long would it have taken the productive mechanism of 1939 
(say) to produce the output of 1947, and conversely with interchange of 
dates? 

(2) To what extent was the change in output accomplished by changes in 
employment, changes in technology, and economies of scale? 

(3) What was the output of consumers’ goods, in the sense of selling values 
at constant prices with allowance for quality changes and in the sense of 
production requirements? What was the investment output in the sense 
of production requirements and in the sense of capacity for new production? 

(4) What are the indexes of output and employment in national income 
and gross national product senses? 

(5) How well do the indexes reconcile with price, value of shipment, na- 
tional income and other independent data? 

These questions cannot be answered easily or quickly, but beginnings can 
be made at once. Perhaps a committee of learned societies is in order for the 
purpose of making recommendations regarding expanded activities of gov- 
ernment statistical workers. 


Washington State Statistical Abstract. Marilyn Druck Robinson. Seattle, Wash- 
ington: University of Washington Press, 1952. Pp. xi, 159. $4.50. Paper. 


Paut B. Simpson, University of Oregon 


ber volume brings together a large quantity of data, covering prin- 
cipally area, population, employment, payrolls, production, trade, 
transportation, income, prices, and finance. The state of Washington nat- 
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urally receives the most attention, but comparative data for the United 
States and for western states are frequently presented. 

Although this volume uses data gleaned from census and other national 
sources as its chief stock in trade, it is far from a mere compendium of such 
material. The reviewer estimates that at least one-fourth of the 104 tables 
contain data not available * U,: standard sources. Examples of information 
hard to come by are trade and income statistics based on Washington tax 
information, indexes of business activity in the Northwest, housing- and 
rental-survey information, family income-survey information, Washington 
data about utilities and motor carriers, and lumber shipments reported by 
the Pacific Lumber Inspection Bureau. Other tables give inter-census inter- 
polations and county and regional break-downs of special information not 
available in standard sources. Payrolls, births and deaths, population, labor 
force, income, and the mineral statistics are examples. 

The work is carefully documented as to sources, nature of information, and 
additional sources, and includes warnings about misusing the data. By care- 
ful study the researcher can obtain a fairly complete picture of what regional 
information is available from all sources. There can never be too much ex- 
planation, however. In using the volume this reviewer found himself wishing 
that it included a summary of the idiosyncrasies of the Washington State 
retail and industrial income tax laws and of covered employment under the 
Social Security Program. A separate list of Washington State reports and 
local source materials would also have been useful to give the research 
worker a handy reference to sources of local data. 

Very few similar volumes exist for other states. The Universities of Ala- 
bama and of Mississippi have compiled statistical abstracts for their states, 
and a certain amount of statistical information is included in the state blue 
books. Generally, regional data are difficult to locate and to obtain. Marilyn 
Robinson and the other members of the Bureau of Business Research of the 
University of Washington who prepared the abstract have made a valuable 
contribution to regional research. 
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